AIOps Architect
GSPANN
All India, Hyderabad • 1 month ago
Experience: 3 to 7 Yrs
PREMIUM
Deal of the Day
--:--:--
7 Days Free Trial
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime).
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
Enter Your Details
Job Description
As an AIOps Architect at GSPANN, your role involves designing and leading enterprise AIOps architecture to enhance observability, incident management, automation, and autonomous remediation. You will be responsible for integrating Machine Learning, IT Service Management (ITSM) platforms, and Site Reliability Engineering (SRE) practices to develop scalable, self-healing operational ecosystems.
**Role Overview:**
In this role, you will:
- Design and implement end-to-end AIOps architecture covering observability, incident lifecycle management, anomaly detection, root cause analysis (RCA), and autonomous remediation.
- Define and maintain AIOps strategy, reference architecture, governance frameworks, and operational blueprints.
- Architect agentic and generative AI patterns for IT Operations, Data Operations, and Platform Operations.
- Design unified observability frameworks spanning logs, metrics, traces, alerts, and events.
- Build scalable event ingestion, correlation, and anomaly detection pipelines using ML and AI models.
- Develop confidence-scored, controlled auto-remediation and auto-correction workflows.
- Create orchestration layers to enable self-healing infrastructure, applications, and data pipelines.
- Integrate runbooks, standard operating procedures (SOPs), and AI-driven virtual agents to automate L1 and L2 operations.
- Integrate AIOps platforms with ITSM tools (ServiceNow, Jira), Configuration Management Database (CMDB), asset inventory systems, and enterprise observability stacks.
- Collaborate with Data Engineering, Cloud, Infrastructure, Security, and Application teams to operationalize AIOps capabilities.
- Align AIOps solutions with SRE principles, ITSM processes, and enterprise reliability objectives.
- Serve as a trusted technical advisor to leadership, contributing to roadmaps, QBRs, and transformation programs.
- Mentor engineering and operations teams on AIOps architecture, automation, and observability best practices.
- Drive adoption through documentation, playbooks, governance models, and enablement initiatives.
**Qualifications Required:**
- 10+ years of experience in IT Operations, SRE, DevOps, or related domains, including 35 years architecting AIOps solutions.
- Hold certifications in Cloud (AWS/Azure/GCP), SRE, ITIL, Observability platforms, or related disciplines (preferred).
- Demonstrate strong hands-on experience with AIOps platforms, observability stacks, and monitoring ecosystems such as Prometheus, Grafana, Elastic, Dynatrace, or Recent Relic.
- Possess experience operationalizing ML models through MLOps practices.
- Apply deep understanding of logs, metrics, traces, event pipelines, and distributed system architectures.
- Design and implement Machine Learning, Generative AI, and agent-based architectures for operational automation.
- Build anomaly detection models, predictive alerting systems, and advanced RCA frameworks.
- Develop event correlation engines and automated remediation workflows.
- Possess strong expertise in infrastructure (compute, storage, networking), cloud platforms (AWS, Azure, GCP), and Kubernetes ecosystems.
- Apply DevOps, CI/CD, SRE, and automation frameworks in enterprise environments.
- Integrate AIOps capabilities with ITSM and CMDB platforms using enterprise integration patterns.
- Design scalable, resilient, modular, and secure AIOps architectures.
- Create reference architectures, governance models, and operational blueprints for enterprise adoption.
- Apply data engineering principles, including ETL/ELT pipelines and data quality frameworks.
- Lead engineering transformation initiatives and drive operational excellence across complex environments.
- Demonstrate strong analytical thinking, problem-solving, and stakeholder management skills. As an AIOps Architect at GSPANN, your role involves designing and leading enterprise AIOps architecture to enhance observability, incident management, automation, and autonomous remediation. You will be responsible for integrating Machine Learning, IT Service Management (ITSM) platforms, and Site Reliability Engineering (SRE) practices to develop scalable, self-healing operational ecosystems.
**Role Overview:**
In this role, you will:
- Design and implement end-to-end AIOps architecture covering observability, incident lifecycle management, anomaly detection, root cause analysis (RCA), and autonomous remediation.
- Define and maintain AIOps strategy, reference architecture, governance frameworks, and operational blueprints.
- Architect agentic and generative AI patterns for IT Operations, Data Operations, and Platform Operations.
- Design unified observability frameworks spanning logs, metrics, traces, alerts, and events.
- Build scalable event ingestion, correlation, and anomaly detection pipelines using ML and AI models.
- Develop confidence-scored, controlled auto-remediation and auto-correction workflows.
- Create orchestration layers to enable self-healing infrastructure, applications, and dat
Skills Required
Machine Learning
IT Service Management
Anomaly Detection
Root Cause Analysis
Automation
Metrics
Alerts
ML
Orchestration
SOPs
CMDB
Kubernetes
DevOps
Stakeholder Management
AIOps
Site Reliability Engineering
Observability
AI Patterns
Logs
Traces
Events
AI Models
Autoremediation
Runbooks
Virtual Agents
ITSM Tools
Asset Inventory Systems
SRE Principles
Cloud Platforms
CICD
ETLELT Pipelines
Data Quality Frameworks
Posted on: March 28, 2026
Relevant Jobs
Step 2 of 2