Freelance Agent Evaluation Engineer
Mindrift · Warsaw, Masovian Voivodeship, Poland · Remote méiglech
Dir gitt op déi original Annonce vum Patron weidergeleet.
- Firma
- Mindrift
- Plaz
- Warsaw, Masovian Voivodeship, Poland
- Aart vum Kontrakt
- Deelzäit
- Publizéiert den
- 20. Mee 2026
Iwwer dës Plaz
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: cra…
Dëst ass eng kuerz Zesummefaassung. Déi voll Beschreiwung fannt Dir op der Säit vum Patron.
Kritt Joben wéi dësen
Maacht e gratis Profil a kritt passend Plazen aus Poland an der ganzer EU.
Meng Matches kucken