Freelance Agent Evaluation Engineer

Mindrift · Warsaw, Masovian Voivodeship, Poland · Remote mogelijk

Je gaat verder naar de originele vacature van de werkgever.

Bedrijf
Mindrift
Locatie
Warsaw, Masovian Voivodeship, Poland
Dienstverband
Deeltijd
Geplaatst op
20 mei 2026

Over deze vacature

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: cra

Dit is een korte samenvatting. De volledige omschrijving staat op de pagina van de werkgever.

Ontvang vacatures zoals deze

Maak een gratis profiel aan en ontvang passende vacatures uit Poland en de hele EU.

Bekijk mijn matches