Freelance Agent Evaluation Engineer

Mindrift · Belgium · Remote-friendly

You will continue to the employer’s original posting.

Company
Mindrift
Location
Belgium
Employment type
Part-time
Posted
May 31, 2026

About this job

Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: cra

This is a short summary. The full description is on the employer’s page.

Get matched to jobs like this

Create a free profile and receive vacancies from Belgium and across the EU that match your skills.

Get my matches