Data Scientist / Analytics Engineer
LinkedIn Corporation
Eluru, All India • 2 months ago
Experience: 5 to 9 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
After Free Trial → Flat 50% OFF
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime). Quoted price includes 50% discount.
Enter Your Details
Job Description
As a senior Databricks Architect, you will be responsible for designing, building, and governing the Lakehouse data platform. Your primary focus will be on driving the adoption of Databricks, Unity Catalog, and modern Lakehouse patterns across all data products and pipelines. Your key responsibilities will include:
- Designing and implementing a production-grade Medallion Architecture (Bronze / Silver / Gold) across all data pipelines.
- Defining data modeling standards and schema evolution policies across the Lakehouse.
- Architecting end-to-end data flows from ingestion (streaming and batch) through transformation and serving layers.
You will also lead the setup, configuration, and rollout of Unity Catalog as the centralized governance layer for all data assets. This includes implementing fine-grained access control, data masking policies, audit logging, data lineage tracking, and data classification frameworks.
In addition, you will be responsible for building and maintaining production-grade data pipelines using PySpark, Delta Live Tables (DLT), and Databricks Workflows / Jobs. This will involve designing modular, reusable pipeline patterns, implementing robust pipeline observability, and leveraging Databricks Repos for CI/CD integration.
Furthermore, you will optimize Spark execution plans, identify and resolve performance bottlenecks, and right-size cluster configurations to ensure efficient processing of large-scale distributed workloads. You will also set up and maintain Databricks Repos with standardized project structures, define Python coding standards, and build reusable Python utility libraries for common patterns.
Qualifications Required:
- 5+ years of hands-on experience with Databricks, with at least 2 years in an architect or senior lead role.
- Deep expertise in Unity Catalog, Medallion Architecture, and Delta Lake.
- Proven experience designing and deploying production pipelines with Databricks Jobs and Workflows.
- Hands-on experience with Databricks Repos and CI/CD integration.
- Experience configuring and operating Serverless SQL Warehouses and Serverless compute for Jobs.
Additionally, you should have experience with DataFrames, Spark SQL, window functions, broadcast joins, UDFs, structured streaming, and micro-batch processing patterns. Advanced Python skills, familiarity with common data engineering libraries, cloud deployment experience, and relevant certifications are also required for this role.
If interested, you will have the opportunity to partner with data engineers, data scientists, and analytics engineers to ensure the platform meets diverse workload needs, mentor the engineering team, and evaluate new Databricks features and third-party integrations relevant to the organization's data roadmap. As a senior Databricks Architect, you will be responsible for designing, building, and governing the Lakehouse data platform. Your primary focus will be on driving the adoption of Databricks, Unity Catalog, and modern Lakehouse patterns across all data products and pipelines. Your key responsibilities will include:
- Designing and implementing a production-grade Medallion Architecture (Bronze / Silver / Gold) across all data pipelines.
- Defining data modeling standards and schema evolution policies across the Lakehouse.
- Architecting end-to-end data flows from ingestion (streaming and batch) through transformation and serving layers.
You will also lead the setup, configuration, and rollout of Unity Catalog as the centralized governance layer for all data assets. This includes implementing fine-grained access control, data masking policies, audit logging, data lineage tracking, and data classification frameworks.
In addition, you will be responsible for building and maintaining production-grade data pipelines using PySpark, Delta Live Tables (DLT), and Databricks Workflows / Jobs. This will involve designing modular, reusable pipeline patterns, implementing robust pipeline observability, and leveraging Databricks Repos for CI/CD integration.
Furthermore, you will optimize Spark execution plans, identify and resolve performance bottlenecks, and right-size cluster configurations to ensure efficient processing of large-scale distributed workloads. You will also set up and maintain Databricks Repos with standardized project structures, define Python coding standards, and build reusable Python utility libraries for common patterns.
Qualifications Required:
- 5+ years of hands-on experience with Databricks, with at least 2 years in an architect or senior lead role.
- Deep expertise in Unity Catalog, Medallion Architecture, and Delta Lake.
- Proven experience designing and deploying production pipelines with Databricks Jobs and Workflows.
- Hands-on experience with Databricks Repos and CI/CD integration.
- Experience configuring and operating Serverless SQL Warehouses and Serverless compute for Jobs.
Additionally, you should have experience with DataFrames, Spark SQL, window f
Skills Required
Python
Software Engineering
AWS
Elasticsearch
Apache Kafka
Databricks
Unity Catalog
Lakehouse patterns
PySpark
Delta Live Tables DLT
Databricks Workflows
Spark SQL
SQL Warehouses
Serverless compute
DataFrames
window functions
broadcast joins
UDFs
structured streaming
microbatch processing
pandas
pydantic
greatexpectations
Terraform
Delta Live Tables DLT
dbt data build tool
Posted on: March 11, 2026
Relevant Jobs
Step 2 of 2