Freelance Agent Evaluation Engineer

Mindrift · Rome, Metropolitan City of Rome Capital, Italy · Remote-friendly

You will continue to the employer’s original posting.

Company
Mindrift
Location
Rome, Metropolitan City of Rome Capital, Italy
Employment type
Part-time
Salary
$40
Posted
June 23, 2026

About this job

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history Design tasks from intermediate states of these environments - craft the prompt, define what "so

This is a short summary. Want to know if you're a fit? Check your CV against this role — free, in 30 seconds.

See if your CV fits this job

Paste your CV for an instant match score against this role — and get a tailored cover letter in one click.

  • Instant match score for this role
  • Tailored cover letter in one click
  • Free — no credit card
Check my CV — free
Freelance Agent Evaluation Engineer — Mindrift | NewLuxJob | NewLuxJob