Lead GCP DevOps Engineer
Fegmo
All India, Noida • 1 month ago
Experience: 7 to 12 Yrs
PREMIUM
Deal of the Day
--:--:--
7 Days Free Trial
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime).
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
Enter Your Details
Job Description
**Job Description**
**Role Overview:**
As a Lead DevOps Engineer at Fegmo, you will be responsible for ensuring the reliability and operational excellence of the Fegmo platform. Your main focus will be on making releases predictable, environments stable, and production observable, secure, and cost-efficient. You will work closely with engineering leadership, backend/front-end teams, and AI engineers in Noida.
**Key Responsibilities:**
- **CI/CD and Release Engineering**
- Own and enhance GitHub Actions pipelines, build and deploy automation, and release readiness practices.
- Introduce practical release gates to reduce rollback and hotfix frequency.
- Improve deployment safety and reduce build times.
- **Cloud Infrastructure and Environments (GCP)**
- Manage deployments and runtime reliability on Google Cloud, ensuring environment consistency.
- Enhance IAM hygiene, secrets handling, and access patterns.
- Optimize Docker images and runtime performance.
- **Observability, Incident Response, and Reliability**
- Implement logging, metrics, alerting, and dashboards for core services.
- Establish on-call and incident practices to improve stability.
- Define reliability targets and operational checks.
- **Cost and Performance Optimization**
- Monitor and optimize cloud spend.
- Improve performance bottlenecks and scaling strategies.
- **Platform Security Basics**
- Enforce secure defaults and secure configuration practices.
- Partner with engineering teams to ensure secure deployment patterns.
- **Support for AI and Data Workloads (light MLOps)**
- Support AI services with reliable deployments and monitoring.
- Enable safe experimentation without affecting production environments.
**Required Skills and Experience:**
- 712+ years in DevOps, SRE, or platform engineering roles.
- Strong experience with CI/CD, Docker, Linux, and GCP.
- Proven track record in improving release reliability and incident response practices.
- Strong fundamentals in networking, security hygiene, and access controls.
- Experience collaborating with engineers to drive adoption.
**Nice-to-Have:**
- Infrastructure as code, Kubernetes, GitOps practices.
- Experience with AI/ML workloads and integration-heavy SaaS platforms.
- Database operations and performance tuning experience.
**Why Join Fegmo **
- Own platform reliability and delivery velocity at an early-stage AI-native company.
- Build the operational foundation for fast product iteration.
- High ownership and impact opportunities.
**Additional Details:**
Fegmo is a mission-driven team valuing collaboration, creativity, and inclusion. Applicants from all backgrounds are welcome, especially those with unique perspectives and a passion for building meaningful tools. **Job Description**
**Role Overview:**
As a Lead DevOps Engineer at Fegmo, you will be responsible for ensuring the reliability and operational excellence of the Fegmo platform. Your main focus will be on making releases predictable, environments stable, and production observable, secure, and cost-efficient. You will work closely with engineering leadership, backend/front-end teams, and AI engineers in Noida.
**Key Responsibilities:**
- **CI/CD and Release Engineering**
- Own and enhance GitHub Actions pipelines, build and deploy automation, and release readiness practices.
- Introduce practical release gates to reduce rollback and hotfix frequency.
- Improve deployment safety and reduce build times.
- **Cloud Infrastructure and Environments (GCP)**
- Manage deployments and runtime reliability on Google Cloud, ensuring environment consistency.
- Enhance IAM hygiene, secrets handling, and access patterns.
- Optimize Docker images and runtime performance.
- **Observability, Incident Response, and Reliability**
- Implement logging, metrics, alerting, and dashboards for core services.
- Establish on-call and incident practices to improve stability.
- Define reliability targets and operational checks.
- **Cost and Performance Optimization**
- Monitor and optimize cloud spend.
- Improve performance bottlenecks and scaling strategies.
- **Platform Security Basics**
- Enforce secure defaults and secure configuration practices.
- Partner with engineering teams to ensure secure deployment patterns.
- **Support for AI and Data Workloads (light MLOps)**
- Support AI services with reliable deployments and monitoring.
- Enable safe experimentation without affecting production environments.
**Required Skills and Experience:**
- 712+ years in DevOps, SRE, or platform engineering roles.
- Strong experience with CI/CD, Docker, Linux, and GCP.
- Proven track record in improving release reliability and incident response practices.
- Strong fundamentals in networking, security hygiene, and access controls.
- Experience collaborating with engineers to drive adoption.
**Nice-to-Have:**
- Infrastructure as code, Kubernetes, GitOps practices.
- Experience with AI/ML
Skills Required
Posted on: March 28, 2026
Relevant Jobs
Step 2 of 2