Freelance Agent Evaluation Engineer

Mindrift · Rome, Metropolitan City of Rome Capital, Italy · Remote méiglech

Dir gitt op déi original Annonce vum Patron weidergeleet.

Firma
Mindrift
Plaz
Rome, Metropolitan City of Rome Capital, Italy
Aart vum Kontrakt
Deelzäit
Pai
$40
Publizéiert den
23. Juni 2026

Iwwer dës Plaz

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents - how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build realistic developer environments - a virtual company with codebase, infrastructure, and context (tickets, docs, conversations) that forms a believable development history Design tasks from intermediate states of these environments - craft the prompt, define what "so

Dat ass eng kuerz Resumé. Wëllt Dir wëssen ob Dir passt? Préift Äre CV fir dës Plaz — gratis, an 30 Sekonnen.

Passt Äre CV zu dëser Plaz?

Kopéiert Äre CV eran fir direkt e Match-Score fir dës Plaz ze kréien — plus e personaliséierte Motivatiounsbréif mat engem Klick.

  • Direkten Match-Score fir dës Plaz
  • Personaliséierte Motivatiounsbréif mat engem Klick
  • Gratis — keng Kreditkaart
Mäi CV préiwen — gratis
Freelance Agent Evaluation Engineer — Mindrift | NewLuxJob | NewLuxJob