Senior Data Architect

Omilia · Poland · Remote méiglech

Dir gitt op déi original Annonce vum Patron weidergeleet.

Firma
Omilia
Plaz
Poland
Aart vum Kontrakt
Vollzäit
Publizéiert den
15. Abrëll 2026

Iwwer dës Plaz

Accountabilities Own the Training Environment data architecture end-to-end: dataset design and schema for all ML training pipelines, including dialog corpora for LLM training, conversational steps for NLU models, annotated evaluation sets, and whole-call recordings for speech-to-speech model development. Define and govern data selection and sampling strategy: establish criteria that determine which production conversations have the highest training value, including diversity-optimized sampling, confidence-based filtering, edge-case prioritization, and deduplication strategies. Build and maintain the data catalog and dataset discovery infrastructure: enable ML engineers across LLM, NLU, Speech, and Agentic teams to find, understand, and use training data without friction. Define annotation

Dëst ass eng kuerz Zesummefaassung. Déi voll Beschreiwung fannt Dir op der Säit vum Patron.

Kritt Joben wéi dësen

Maacht e gratis Profil a kritt passend Plazen aus Poland an der ganzer EU.

Meng Matches kucken