Sr. Data Engineer
Abacus Insights
All India • 2 months ago
Experience: 5 to 9 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
After Free Trial → Flat 50% OFF
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime). Quoted price includes 50% discount.
Enter Your Details
Job Description
Role Overview:
Abacus Insights is looking for an accomplished Data Engineer to join their dynamic and rapidly expanding Tech Ops division. In this role, you will have the opportunity to work directly with customers, data vendors, and internal engineering teams to design, implement, and optimize complex data integration solutions within a modern, large-scale cloud environment. Your primary responsibilities will include leveraging advanced skills in distributed computing, data architecture, and cloud-native engineering to enable scalable, resilient, and high-performance data ingestion and transformation pipelines. As a trusted technical advisor, you will guide customers in adopting Abacuss core data management platform and ensure high-quality, compliant data operations across the lifecycle.
Key Responsibilities:
- Architect, design, and implement high-volume batch and real-time data pipelines using PySpark, SparkSQL, Databricks Workflows, and distributed processing frameworks.
- Build end-to-end ingestion frameworks integrating with Databricks, Snowflake, AWS services (S3, SQS, Lambda), and vendor data APIs, ensuring data quality, lineage, and schema evolution.
- Develop data modeling frameworks, including star/snowflake schemas and optimization techniques for analytical workloads on cloud data warehouses.
- Lead technical solution design for health plan clients, creating highly available, fault-tolerant architectures across multi-account AWS environments.
- Translate complex business requirements into detailed technical specifications, engineering artifacts, and reusable components.
- Implement security automation, including RBAC, encryption at rest/in transit, PHI handling, tokenization, auditing, and compliance with HIPAA and SOC 2 frameworks.
- Establish and enforce data engineering best practices, such as CI/CD for data pipelines, code versioning, automated testing, orchestration, logging, and observability patterns.
- Conduct performance profiling and optimize compute costs, cluster configurations, partitions, indexing, and caching strategies across Databricks and Snowflake environments.
- Produce high-quality technical documentation including runbooks, architecture diagrams, and operational standards.
- Mentor junior engineers through technical reviews, coaching, and training sessions for both internal teams and clients.
Qualifications Required:
- Bachelors degree in Computer Science, Computer Engineering, or a closely related technical field.
- 5+ years of hands-on experience as a Data Engineer working with large-scale, distributed data processing systems in modern cloud environments.
- Working knowledge of U.S. healthcare data domainsincluding claims, eligibility, and provider datasetsand experience applying this knowledge to complex ingestion and transformation workflows.
- Strong ability to communicate complex technical concepts clearly across both technical and non-technical stakeholders.
- Expert-level proficiency in Python, SQL, and PySpark, including developing distributed data transformations and performance-optimized queries.
- Demonstrated experience designing, building, and operating production-grade ETL/ELT pipelines using Databricks, Airflow, or similar orchestration and workflow automation tools.
- Proven experience architecting or operating large-scale data platforms using dbt, Kafka, Delta Lake, and event-driven/streaming architectures, within a cloud-native data services or platform engineering environmentrequiring specialized knowledge of distributed systems, scalable data pipelines, and cloud-scale data processing.
- Experience working with structured and semi-structured data formats such as Parquet, ORC, JSON, and Avro, including schema evolution and optimization techniques.
- Strong working knowledge of AWS data ecosystem componentsincluding S3, SQS, Lambda, Glue, IAMor equivalent cloud technologies supporting high-volume data engineering workloads.
- Proficiency with Terraform, infrastructure-as-code methodologies, and modern CI/CD pipelines (e.g., GitLab) supporting automated deployment and versioning of data systems.
- Deep expertise in SQL and compute optimization strategies, including Z-Ordering, clustering, partitioning, pruning, and caching for large-scale analytical and operational workloads.
- Hands-on experience with major cloud data warehouse platforms such as Snowflake (preferred), BigQuery, or Redshift, including performance tuning and data modeling for analytical environments. Role Overview:
Abacus Insights is looking for an accomplished Data Engineer to join their dynamic and rapidly expanding Tech Ops division. In this role, you will have the opportunity to work directly with customers, data vendors, and internal engineering teams to design, implement, and optimize complex data integration solutions within a modern, large-scale cloud environment. Your primary responsibilities will include leveraging advanced skills in distributed computing, data architectu
Skills Required
Posted on: March 11, 2026
Relevant Jobs
Step 2 of 2