Senior Data Architect

Omilia · Poland · Télétravail possible

Vous continuerez vers l’annonce originale de l’employeur.

Entreprise
Omilia
Lieu
Poland
Type de contrat
Temps plein
Publiée le
15 avril 2026

À propos de ce poste

Accountabilities Own the Training Environment data architecture end-to-end: dataset design and schema for all ML training pipelines, including dialog corpora for LLM training, conversational steps for NLU models, annotated evaluation sets, and whole-call recordings for speech-to-speech model development. Define and govern data selection and sampling strategy: establish criteria that determine which production conversations have the highest training value, including diversity-optimized sampling, confidence-based filtering, edge-case prioritization, and deduplication strategies. Build and maintain the data catalog and dataset discovery infrastructure: enable ML engineers across LLM, NLU, Speech, and Agentic teams to find, understand, and use training data without friction. Define annotation

Ceci est un résumé court. La description complète se trouve sur la page de l’employeur.

Recevez des offres comme celle-ci

Créez un profil gratuit et recevez des offres de Poland et de toute l’UE correspondant à vos compétences.

Voir mes correspondances