Freelance AI Evaluation Engineer (Python/Full-Stack) (Hyderabad)

Mindrift

All India, Hyderabad • 1 month ago

Experience: 5 to 9 Yrs

PREMIUM

Deal of the Day --:--:--

A recruiter messaged CVX24 Premium users few seconds ago.

Upgrade to CVX24 Premium: Only $2.49

Free Resume Writing
Get a Verified Blue tick
See who viewed your profile
Unlimited chat with recruiters
Rank higher in recruiter searches
Get up to 10× more recruiter visibility
Get practical interview tips and guidance
Receive verified recruiter messages directly
Unlock hidden jobs, not visible to free users

$4.99 $2.49 🔥 50% OFF

Activate

$4.99 $2.49 all inc.

🔥 50% OFF

(Validity: 6 Months. After payment confirmation we will reach out to you)

Job Description

You will be connecting specialists with project-based AI opportunities for leading tech companies through Mindrift. The focus will be on testing, evaluating, and improving AI systems. Participation in projects is on a project-based basis, not as permanent employment. Key Responsibilities: - Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources - Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks - Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) - Analyze AI failures to understand what the model struggles with versus what it masters - Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria Qualifications Required: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily in Python (pytest, async/await, subprocess, file operations) - Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems - Experience writing tests (functional, integration not just running them) - Familiarity with Docker containers (running evaluations locally in containers) - Understanding of CI/CD (GitHub Actions as a user: triggers, labels, reading results) - English proficiency at level B2 This opportunity is suitable for experienced developers, software engineers, and test automation specialists who are open to part-time, non-permanent projects. The compensation for this project can go up to $12 per hour equivalent, depending on the level and pace of your contribution. The effort estimate for tasks in this project is approximately 20 hours, depending on complexity. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Please note that compensation may vary across projects based on scope, complexity, and required expertise. Keep in mind that other projects on the platform may offer different earning levels based on their specific requirements. You will be connecting specialists with project-based AI opportunities for leading tech companies through Mindrift. The focus will be on testing, evaluating, and improving AI systems. Participation in projects is on a project-based basis, not as permanent employment. Key Responsibilities: - Review and refine realistic coding tasks based on provided production codebases with realistic scope, requirements, and information sources - Write comprehensive functional tests that validate actual end-to-end behavior and edge-cases, not just superficial checks - Craft "fair but hard" challenges where the AI has all the context it needs, but has to work for it (information scattered across files and external sources, complex reasoning required) - Analyze AI failures to understand what the model struggles with versus what it masters - Iterate based on feedback from expert QA reviewers who score your work on 7 quality criteria Qualifications Required: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily in Python (pytest, async/await, subprocess, file operations) - Background in Full-Stack development, with an equal focus on building React-based interfaces and robust Back-end systems - Experience writing tests (functional, integration not just running them) - Familiarity with Docker containers (running evaluations locally in containers) - Understanding of CI/CD (GitHub Actions as a user: triggers, labels, reading results) - English proficiency at level B2 This opportunity is suitable for experienced developers, software engineers, and test automation specialists who are open to part-time, non-permanent projects. The compensation for this project can go up to $12 per hour equivalent, depending on the level and pace of your contribution. The effort estimate for tasks in this project is approximately 20 hours, depending on complexity. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Please note that compensation may vary across projects based on scope, complexity, and required expertise. Keep in mind that other projects on the platform may offer different earning levels based on their specific requirements.

Skills Required

Python Software development Docker FullStack development React Backend systems CICD

Posted on: March 18, 2026

Relevant Jobs

Medical Copywriter

Thepharmadaily

All India

View Job →

QuickTV AI Video and Sound Editor (Contract)

Sharechat

All India

View Job →

Senior Designer- Electrical

Barry-Wehmiller

All India, Chennai

View Job →

Digital and print media artist

Stackular

All India, Hyderabad

View Job →

Director Brand Marketing

Upstox

All India

View Job →

Content and Social Media Marketing Internship

calmveda

All India, Delhi

View Job →

Social Media & Content Lead

FrugalTesting

All India

View Job →

Video Content Creator/Producer (Shoot & Edit)

alt.f coworking

All India, Gurugram

View Job →

Video Editing/Making - Internship

Animtopedia Private Limited

All India, Faridabad

View Job →

Senior Performance Marketer

Get Marketed

All India, Jaipur

View Job →

Freelance AI Evaluation Engineer (Python/Full-Stack) (Hyderabad)

A recruiter messaged CVX24 Premium users few seconds ago.

Enter Your Details

Job Description

Skills Required

Relevant Jobs

Medical Copywriter

QuickTV AI Video and Sound Editor (Contract)

Senior Designer- Electrical

Digital and print media artist

Director Brand Marketing

Content and Social Media Marketing Internship

Social Media & Content Lead

Video Content Creator/Producer (Shoot & Edit)

Video Editing/Making - Internship

Senior Performance Marketer

Application Submitted

Your Professional Info

Login / Register Free