AI Engineer - Autonomous Agents & Model Infrastructure
IQuest Solutions Corporation
All India • 2 months ago
Experience: 2 to 6 Yrs
PREMIUM
Deal of the Day
--:--:--
15 Days Free Trial
After Free Trial → Flat 50% OFF
Upgrade to CVX24 Premium
- Free Resume Writing
-
Get a Verified Blue tick
- See who viewed your profile
- Unlimited chat with recruiters
- Rank higher in recruiter searches
- Get up to 10× more recruiter visibility
- Auto-forward profile to 10 top recruiters
- Receive verified recruiter messages directly
- Unlock hidden jobs, not visible to free users
$0
Activate
$0
A small token amount will be charged to verify.
Get Refund in 48 Hours.
Free Earplugs Delivery Only after Payment of Rs. 99 for Five Consecutive Months.
After free-trial 6 Months subscription will be auto Activated @ $
1
(Cancel Anytime). Quoted price includes 50% discount.
Enter Your Details
Job Description
As an experienced AI Engineer, you will be responsible for designing, deploying, and managing autonomous agent systems on proprietary infrastructure. Your role will involve owning the full lifecycle of the process, from optimizing model weights to building production-grade agents with fine-tuning and reinforcement learning on on-premises or private cloud environments.
Key Responsibilities:
- Design and deploy autonomous agent architectures on AWS VPC and on-premise environments
- Manage model weights and optimize for inference; implement LoRA and QLoRA fine-tuning for domain-specific tasks
- Develop reinforcement learning pipelines for agent training with reward modeling and policy optimization
- Implement MLOps/LLMOps infrastructure: model versioning, A/B testing, rollbacks, and evaluation frameworks
- Architect RAG systems integrating vector databases with proprietary and fine-tuned models
- Optimize model serving infrastructure (vLLM, TorchServe, TensorRT) for production inference
- Build monitoring and observability systems for agent behavior and RL training quality
- Ensure model security, data privacy, and audit compliance in enterprise deployments
Required Qualifications:
- 3-4 years hands-on experience in AI/ML/Data Science with at least 2 projects shipped to production
- 2+ years dedicated experience in MLOps, LLMOps, or AIops (model deployment, inference optimization, pipeline automation, model management)
- AWS proficiency across AI services: EC2, VPC, S3, IAM, SageMaker, Bedrock, Lambda, or custom ML infrastructure
- Strong software engineering fundamentals: containerization (Docker), orchestration (Kubernetes), CI/CD, and API design
- Hands-on experience deploying and serving large language models or foundation models in production environments
- Practical experience with LoRA and QLoRA fine-tuning techniques for efficient model adaptation
- Understanding of reinforcement learning fundamentals and experience implementing RL-based training
- Working knowledge of vector databases and RAG implementation
- Solid understanding of model optimization techniques and inference constraints
Preferred Qualifications:
- Experience building autonomous agents with RL frameworks and fine-tuning frameworks
- QLoRA experience on consumer-grade GPUs in memory-constrained environments
- Migration experience from cloud APIs to self-hosted models
- On-premises or VPC-only deployment experience
- Familiarity with agent frameworks and MLOps tools
- Strong debugging and systems thinking approach with evidence-based problem-solving
The company values pragmatic engineers who balance performance and cost, strong debuggers with evidence-based approaches, clear communicators, and owners who see projects end-to-end.
You will work with technologies such as AWS, Python, Docker, Kubernetes, LLMs/Foundation Models, LoRA/QLoRA Libraries, Reinforcement Learning Frameworks, Vector Databases, Agent Frameworks, Model Serving Technologies, Fine-tuning Tools, and CI/CD Pipelines. As an experienced AI Engineer, you will be responsible for designing, deploying, and managing autonomous agent systems on proprietary infrastructure. Your role will involve owning the full lifecycle of the process, from optimizing model weights to building production-grade agents with fine-tuning and reinforcement learning on on-premises or private cloud environments.
Key Responsibilities:
- Design and deploy autonomous agent architectures on AWS VPC and on-premise environments
- Manage model weights and optimize for inference; implement LoRA and QLoRA fine-tuning for domain-specific tasks
- Develop reinforcement learning pipelines for agent training with reward modeling and policy optimization
- Implement MLOps/LLMOps infrastructure: model versioning, A/B testing, rollbacks, and evaluation frameworks
- Architect RAG systems integrating vector databases with proprietary and fine-tuned models
- Optimize model serving infrastructure (vLLM, TorchServe, TensorRT) for production inference
- Build monitoring and observability systems for agent behavior and RL training quality
- Ensure model security, data privacy, and audit compliance in enterprise deployments
Required Qualifications:
- 3-4 years hands-on experience in AI/ML/Data Science with at least 2 projects shipped to production
- 2+ years dedicated experience in MLOps, LLMOps, or AIops (model deployment, inference optimization, pipeline automation, model management)
- AWS proficiency across AI services: EC2, VPC, S3, IAM, SageMaker, Bedrock, Lambda, or custom ML infrastructure
- Strong software engineering fundamentals: containerization (Docker), orchestration (Kubernetes), CI/CD, and API design
- Hands-on experience deploying and serving large language models or foundation models in production environments
- Practical experience with LoRA and QLoRA fine-tuning techniques for efficient model adaptation
- Understanding of reinforcement learning fundamentals and experience implementing RL-based training
Skills Required
Posted on: March 16, 2026
Relevant Jobs
Step 2 of 2