Freelance Agent Evaluation Engineer

Mindrift · Warsaw, Masovian Voivodeship, Poland · Remote méiglech

Dir gitt op déi original Annonce vum Patron weidergeleet.

Firma
Mindrift
Plaz
Warsaw, Masovian Voivodeship, Poland
Aart vum Kontrakt
Deelzäit
Publizéiert den
20. Mee 2026

Iwwer dës Plaz

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: cra

Dëst ass eng kuerz Zesummefaassung. Déi voll Beschreiwung fannt Dir op der Säit vum Patron.

Kritt Joben wéi dësen

Maacht e gratis Profil a kritt passend Plazen aus Poland an der ganzer EU.

Meng Matches kucken