Lead Reliability Engineer
Seclore
All India, Delhi • 2 months ago
Experience: 4 to 8 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
After free-trial 6 Months subscription will be auto Activated @ $2.49 (Cancel Anytime).
Free Bluetooth earphones with 6 Months subscription only.
Enter Your Details
Job Description
As a Senior Site Reliability Engineer at Seclore, you will play a crucial role in designing, building, and operating highly available and scalable cloud infrastructure supporting Seclores data-centric security platform. Your responsibilities will include leading reliability engineering, performance optimization, monitoring, automation, and incident management across global cloud operations as the platform grows. Here's what you can expect in this role:
- Maintain cloud environments (AWS, Azure, GCP) for Seclores SaaS offerings with high availability, fault tolerance, and disaster recovery readiness.
- Drive SLOs, SLIs, and error-budget practices to improve reliability and reduce downtime and latency issues.
- Implement monitoring, logging, and tracing solutions (Prometheus, Grafana, ELK, OpenTelemetry) with robust alerting and runbooks.
- Automate infrastructure provisioning (IaC: Terraform, CloudFormation), CI/CD deployments, configuration, and operational workflows.
- Lead major incident responses, conduct blameless post-mortems, and drive long-term remediation.
- Monitor resource usage, conduct capacity planning, and optimize cloud costs while maintaining performance.
- Partner with security and product teams to ensure compliance, IAM best practices, network segmentation, and secure cloud architecture.
- Collaborate with engineering, product, QA, and support teams to embed SRE principles across the organization.
- Mentor junior team members and contribute to scaling a reliability-focused culture.
- Continuously evaluate new cloud tools, techniques, and services to enhance scalability and operational excellence.
Qualifications Required:
- 46+ years of experience in SRE / DevOps / Cloud Engineering roles (SaaS or product companies preferred).
- Strong expertise with AWS, Azure, or GCP (GCP is an added advantage).
- Hands-on experience with IaC tools (Terraform, CloudFormation) and container orchestration (Kubernetes, EKS/ECS).
- Proven experience in monitoring, alerting, tracing, and incident response in production environments.
- Deep understanding of SRE fundamentalsSLOs, SLIs, error budgets, resiliency, chaos testing.
- Strong scripting/automation abilities (Python, Bash, Go).
- Familiarity with configuration management systems (Ansible, Chef, Puppet).
- Excellent troubleshooting, analytical, and communication skills.
- Ability to mentor, guide, and influence engineering teams.
Additionally, Seclore values and supports those who take initiative and calculated risks, carrying a problem-solver attitude and a tech-agnostic aptitude. You will have the opportunity to work with smart minds in the business and thrive in an environment that focuses on outstanding employee experiences. If you are excited to be part of a team where you can build the future of data security, apply to become the next Entrepreneur at Seclore today! As a Senior Site Reliability Engineer at Seclore, you will play a crucial role in designing, building, and operating highly available and scalable cloud infrastructure supporting Seclores data-centric security platform. Your responsibilities will include leading reliability engineering, performance optimization, monitoring, automation, and incident management across global cloud operations as the platform grows. Here's what you can expect in this role:
- Maintain cloud environments (AWS, Azure, GCP) for Seclores SaaS offerings with high availability, fault tolerance, and disaster recovery readiness.
- Drive SLOs, SLIs, and error-budget practices to improve reliability and reduce downtime and latency issues.
- Implement monitoring, logging, and tracing solutions (Prometheus, Grafana, ELK, OpenTelemetry) with robust alerting and runbooks.
- Automate infrastructure provisioning (IaC: Terraform, CloudFormation), CI/CD deployments, configuration, and operational workflows.
- Lead major incident responses, conduct blameless post-mortems, and drive long-term remediation.
- Monitor resource usage, conduct capacity planning, and optimize cloud costs while maintaining performance.
- Partner with security and product teams to ensure compliance, IAM best practices, network segmentation, and secure cloud architecture.
- Collaborate with engineering, product, QA, and support teams to embed SRE principles across the organization.
- Mentor junior team members and contribute to scaling a reliability-focused culture.
- Continuously evaluate new cloud tools, techniques, and services to enhance scalability and operational excellence.
Qualifications Required:
- 46+ years of experience in SRE / DevOps / Cloud Engineering roles (SaaS or product companies preferred).
- Strong expertise with AWS, Azure, or GCP (GCP is an added advantage).
- Hands-on experience with IaC tools (Terraform, CloudFormation) and container orchestration (Kubernetes, EKS/ECS).
- Proven experience in monitoring, alerting, tracing, and incident response in production environments.
- Deep understanding of SRE fundamentalsSLOs,
Skills Required
Posted on: March 1, 2026
Relevant Jobs
Step 2 of 2