Freelance Agent Evaluation Engineer

Mindrift · Warsaw, Masovian Voivodeship, Poland · Télétravail possible

Vous continuerez vers l’annonce originale de l’employeur.

Entreprise
Mindrift
Lieu
Warsaw, Masovian Voivodeship, Poland
Type de contrat
Temps partiel
Publiée le
20 mai 2026

À propos de ce poste

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: cra

Ceci est un résumé court. La description complète se trouve sur la page de l’employeur.

Recevez des offres comme celle-ci

Créez un profil gratuit et recevez des offres de Poland et de toute l’UE correspondant à vos compétences.

Voir mes correspondances