Freelance Agent Evaluation Engineer
Mindrift · Rome, Metropolitan City of Rome Capital, Italy · Remote-friendly
You will continue to the employer’s original posting.
- Company
- Mindrift
- Location
- Rome, Metropolitan City of Rome Capital, Italy
- Employment type
- Part-time
- Salary
- $40
- Posted
- June 23, 2026
About this job
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history Design tasks from intermediate states of these environments - craft the prompt, define what "so…
This is a short summary. Want to know if you're a fit? Check your CV against this role — free, in 30 seconds.
See if your CV fits this job
Paste your CV for an instant match score against this role — and get a tailored cover letter in one click.
- Instant match score for this role
- Tailored cover letter in one click
- Free — no credit card