Speech ML Engineer

Machine LearningSeniorFull-timeRemote

$180K-$300KPosted 1 months ago

About the Role

ElevenLabs is seeking a Speech ML Engineer to advance the state of the art in voice synthesis and cloning. You will develop neural TTS models, work on voice conversion techniques, and build systems for ultra-realistic speech generation.

Requirements

4+ years of experience in speech/audio ML
Expertise in neural TTS (Tacotron, VITS, etc.)
Strong proficiency in Python and PyTorch
Experience with audio processing and signal analysis
Understanding of speech synthesis evaluation metrics

Nice to Have

Experience with voice cloning or voice conversion
Background in music information retrieval
Experience with codec-based speech models
Multilingual speech synthesis experience

Benefits

Early-stage equity

Health benefits

Remote-first culture

Hardware budget

Conference travel

Flexible PTO

Skills

Speech SynthesisTTSPyTorchAudio MLPythonVoice Cloning

Apply for this Position

Related Jobs

Senior Machine Learning Engineer

OpenAI · San Francisco, CA

$250K-$400KMachine Learning

ML Engineer — FAIR

Meta · Menlo Park, CA

$200K-$350KMachine Learning

ML Engineer — Constitutional AI

Anthropic · San Francisco, CA · Remote

$220K-$370KMachine Learning

ML Engineer — Evaluation

Scale AI · San Francisco, CA

$190K-$300KMachine Learning

Preparing for Your AI Career?

Vincony has all 400+ AI models in one place — compare responses, AI debate, Image/Video/Voice generator, and 20 more tools to help you learn and build with AI.

Visit Vincony.com