Mindrift Logo

Freelance AI Evaluation Engineer (Python/Full-Stack) (Hyderabad)

Mindrift

All India, Hyderabad • 1 month ago

Experience: 5 to 9 Yrs

PREMIUM
Deal of the Day --:--:--

A recruiter messaged CVX24 Premium users few seconds ago.

Upgrade to CVX24 Premium: Only $2.49

Bluetooth Earphone
  • Free Resume Writing
  • Get a Verified Blue tick
  • See who viewed your profile
  • Unlimited chat with recruiters
  • Rank higher in recruiter searches
  • Get up to 10× more recruiter visibility
  • Get practical interview tips and guidance
  • Receive verified recruiter messages directly
  • Unlock hidden jobs, not visible to free users
$4.99 $2.49 🔥 50% OFF
Activate
Bluetooth Earphone

(Validity: 6 Months. After payment confirmation we will reach out to you)

Job Description

You will be connecting specialists with project-based AI opportunities for leading tech companies through Mindrift. The focus will be on testing, evaluating, and improving AI systems. Participation in projects is on a project-based basis, not as permanent employment. Key Responsibilities: - Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources - Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks - Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) - Analyze AI failures to understand what the model struggles with versus what it masters - Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria Qualifications Required: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily in Python (pytest, async/await, subprocess, file operations) - Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems - Experience writing tests (functional, integration not just running them) - Familiarity with Docker containers (running evaluations locally in containers) - Understanding of CI/CD (GitHub Actions as a user: triggers, labels, reading results) - English proficiency at level B2 This opportunity is suitable for experienced developers, software engineers, and test automation specialists who are open to part-time, non-permanent projects. The compensation for this project can go up to $12 per hour equivalent, depending on the level and pace of your contribution. The effort estimate for tasks in this project is approximately 20 hours, depending on complexity. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Please note that compensation may vary across projects based on scope, complexity, and required expertise. Keep in mind that other projects on the platform may offer different earning levels based on their specific requirements. You will be connecting specialists with project-based AI opportunities for leading tech companies through Mindrift. The focus will be on testing, evaluating, and improving AI systems. Participation in projects is on a project-based basis, not as permanent employment. Key Responsibilities: - Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources - Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks - Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) - Analyze AI failures to understand what the model struggles with versus what it masters - Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria Qualifications Required: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily in Python (pytest, async/await, subprocess, file operations) - Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems - Experience writing tests (functional, integration not just running them) - Familiarity with Docker containers (running evaluations locally in containers) - Understanding of CI/CD (GitHub Actions as a user: triggers, labels, reading results) - English proficiency at level B2 This opportunity is suitable for experienced developers, software engineers, and test automation specialists who are open to part-time, non-permanent projects. The compensation for this project can go up to $12 per hour equivalent, depending on the level and pace of your contribution. The effort estimate for tasks in this project is approximately 20 hours, depending on complexity. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Please note that compensation may vary across projects based on scope, complexity, and required expertise. Keep in mind that other projects on the platform may offer different earning levels based on their specific requirements.

Posted on: March 18, 2026

Relevant Jobs

Medical Copywriter

Thepharmadaily

All India

View Job →

QuickTV AI Video and Sound Editor (Contract)

Sharechat

All India

View Job →

Senior Designer- Electrical

Barry-Wehmiller

All India, Chennai

View Job →

Digital and print media artist

Stackular

All India, Hyderabad

View Job →

Director Brand Marketing

Upstox

All India

View Job →

Content and Social Media Marketing Internship

calmveda

All India, Delhi

View Job →

Social Media & Content Lead

FrugalTesting

All India

View Job →

Video Content Creator/Producer (Shoot & Edit)

alt.f coworking

All India, Gurugram

View Job →

Video Editing/Making - Internship

Animtopedia Private Limited

All India, Faridabad

View Job →

Senior Performance Marketer

Get Marketed

All India, Jaipur

View Job →