Principal Site Reliability Engineering/ Devops
Cvent, Inc.
All India, Gurugram • 2 months ago
Experience: 10 to 14 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
After free-trial 6 Months subscription will be auto Activated @ $2.49 (Cancel Anytime).
Free Bluetooth earphones with 6 Months subscription only.
Enter Your Details
Job Description
As a Principal Site Reliability Engineer at Cvent, you will play a crucial role in scaling systems, ensuring stability, reliability, and performance, and enabling rapid deployments of the platform. You will be at the forefront of adopting emerging technologies and processes to enhance software deployment, strengthen security, mitigate risks, and enhance the overall end-user experience. Your contributions will be instrumental in advancing DevOps maturity and fostering a culture of quality and site reliability within the Technology R&D Team.
**Key Responsibilities:**
- Continuously assess and integrate emerging cloud and AI/automation technologies
- Lead the design and implementation of CI/CD, containerization, and IaC for large-scale environments
- Establish observability, monitoring, and alerting strategies using tools like Datadog, Prometheus, Grafana, and ELK
- Lead capacity planning, cost optimization, and disaster recovery efforts
- Translate business risk and product goals into actionable reliability strategies
- Mentor and upskill SRE/DevOps engineers
- Pioneer the use of AI-powered automation for alert triage and workflow efficiencies
- Represent technology priorities to leadership and stakeholders
**Qualifications Required:**
- 10-13 years of experience in SRE, cloud engineering, or DevOps with significant time in an architect, staff, or principal role
- Proficiency in AWS, distributed systems architecture, and infrastructure as code
- Track record in driving adoption of AI, automation, and ML for operational efficiency
- Strong programming/scripting skills with expertise in Python, Go, or similar languages
- In-depth knowledge of Linux internals and troubleshooting distributed systems
- Experience in networking, cloud, databases, and scripting with multi-tier architectures
- Excellent communication and coaching skills across engineering and product teams
- Mastery of incident management, postmortem culture, and root cause analysis
- Deep understanding of Unix/Linux environments and system internals
- Ability to develop solutions based on multiple technologies
Apply now for the opportunity to shape the future of site reliability engineering at Cvent and drive innovation in a dynamic and collaborative environment. As a Principal Site Reliability Engineer at Cvent, you will play a crucial role in scaling systems, ensuring stability, reliability, and performance, and enabling rapid deployments of the platform. You will be at the forefront of adopting emerging technologies and processes to enhance software deployment, strengthen security, mitigate risks, and enhance the overall end-user experience. Your contributions will be instrumental in advancing DevOps maturity and fostering a culture of quality and site reliability within the Technology R&D Team.
**Key Responsibilities:**
- Continuously assess and integrate emerging cloud and AI/automation technologies
- Lead the design and implementation of CI/CD, containerization, and IaC for large-scale environments
- Establish observability, monitoring, and alerting strategies using tools like Datadog, Prometheus, Grafana, and ELK
- Lead capacity planning, cost optimization, and disaster recovery efforts
- Translate business risk and product goals into actionable reliability strategies
- Mentor and upskill SRE/DevOps engineers
- Pioneer the use of AI-powered automation for alert triage and workflow efficiencies
- Represent technology priorities to leadership and stakeholders
**Qualifications Required:**
- 10-13 years of experience in SRE, cloud engineering, or DevOps with significant time in an architect, staff, or principal role
- Proficiency in AWS, distributed systems architecture, and infrastructure as code
- Track record in driving adoption of AI, automation, and ML for operational efficiency
- Strong programming/scripting skills with expertise in Python, Go, or similar languages
- In-depth knowledge of Linux internals and troubleshooting distributed systems
- Experience in networking, cloud, databases, and scripting with multi-tier architectures
- Excellent communication and coaching skills across engineering and product teams
- Mastery of incident management, postmortem culture, and root cause analysis
- Deep understanding of Unix/Linux environments and system internals
- Ability to develop solutions based on multiple technologies
Apply now for the opportunity to shape the future of site reliability engineering at Cvent and drive innovation in a dynamic and collaborative environment.
Skills Required
Posted on: March 5, 2026
Relevant Jobs
Step 2 of 2