Gitlab Logo

Senior Site Reliability Engineer, Tenant Services Geo (Mumbai)

Gitlab

All India 3 to 7 Yrs 1 month ago

Job Description

As a Site Reliability Engineer (SRE) at GitLab, you will be responsible for ensuring the smooth operation of all user-facing services and production systems. You will be part of the Tenant Services, Geo team, which focuses on data replication and disaster recovery for GitLab Dedicated customers.

### Key Responsibilities:

  • Execute Dedicated Geo migrations and cutovers from planning to verification and cleanup
  • Participate in shift and weekend coverage for Dedicated cutovers and SaaS Site Reliability Engineering on-call rotation
  • Operate and enhance the Geo operational surface for Dedicated, including preparation, execution, and handling of escalations
  • Design, develop, and maintain automation, tooling, and runbooks to streamline migrations and cutovers
  • Utilize tools like Ansible, Chef, Terraform, GitLab CI/CD, and Kubernetes for infrastructure management
  • Collaborate with various teams to improve Geo features, migration planning, and reliability enhancements
  • Contribute to incident reviews and implement changes to automation, processes, or products
  • Document all actions for knowledge sharing and automation purposes
  • Proactively identify and automate repetitive operational tasks to reduce manual work

### Qualifications Required:

  • Prior experience in operating highly-available distributed systems
  • Familiarity with disaster recovery technologies or GitLab is a plus
  • Willingness to learn and adapt to GitLab's specific stack

Join GitLab to co-create the future by leveraging technology to revolutionize software development processes. Explore a high-performance culture where innovation thrives and every voice is valued. Accelerate your career with us and be a part of a team that embraces AI to drive efficiency, innovation, and impact. As a Site Reliability Engineer (SRE) at GitLab, you will be responsible for ensuring the smooth operation of all user-facing services and production systems. You will be part of the Tenant Services, Geo team, which focuses on data replication and disaster recovery for GitLab Dedicated customers.

### Key Responsibilities:

  • Execute Dedicated Geo migrations and cutovers from planning to verification and cleanup
  • Participate in shift and weekend coverage for Dedicated cutovers and SaaS Site Reliability Engineering on-call rotation
  • Operate and enhance the Geo operational surface for Dedicated, including preparation, execution, and handling of escalations
  • Design, develop, and maintain automation, tooling, and runbooks to streamline migrations and cutovers
  • Utilize tools like Ansible, Chef, Terraform, GitLab CI/CD, and Kubernetes for infrastructure management
  • Collaborate with various teams to improve Geo features, migration planning, and reliability enhancements
  • Contribute to incident reviews and implement changes to automation, processes, or products
  • Document all actions for knowledge sharing and automation purposes
  • Proactively identify and automate repetitive operational tasks to reduce manual work

### Qualifications Required:

  • Prior experience in operating highly-available distributed systems
  • Familiarity with disaster recovery technologies or GitLab is a plus
  • Willingness to learn and adapt to GitLab's specific stack

Join GitLab to co-create the future by leveraging technology to revolutionize software development processes. Explore a high-performance culture where innovation thrives and every voice is valued. Accelerate your career with us and be a part of a team that embraces AI to drive efficiency, innovation, and impact.

Posted on: April 12, 2026