Senior Data Architect
Omilia · Poland · Télétravail possible
Vous continuerez vers l’annonce originale de l’employeur.
- Entreprise
- Omilia
- Lieu
- Poland
- Type de contrat
- Temps plein
- Publiée le
- 15 avril 2026
À propos de ce poste
Accountabilities Own the Training Environment data architecture end-to-end: dataset design and schema for all ML training pipelines, including dialog corpora for LLM training, conversational steps for NLU models, annotated evaluation sets, and whole-call recordings for speech-to-speech model development. Define and govern data selection and sampling strategy: establish criteria that determine which production conversations have the highest training value, including diversity-optimized sampling, confidence-based filtering, edge-case prioritization, and deduplication strategies. Build and maintain the data catalog and dataset discovery infrastructure: enable ML engineers across LLM, NLU, Speech, and Agentic teams to find, understand, and use training data without friction. Define annotation …
Ceci est un résumé court. La description complète se trouve sur la page de l’employeur.
Recevez des offres comme celle-ci
Créez un profil gratuit et recevez des offres de Poland et de toute l’UE correspondant à vos compétences.
Voir mes correspondances