Ready to build speech AI that actually works in real-time?
A well-funded AI startup has developed new model architectures that make real-time conversational AI finally viable at scale. While most voice AI still suffers from delays and computational bottlenecks, they've solved the core efficiency problems that have held the field back.The role
As their Speech Research Scientist, you'll build the speech models that could define the next decade of voice interaction. You'll work on novel architectures that have immediate real-world impact for thousands of customers.What you'll do
- Design and implement SOTA speech synthesis models
- Develop efficient algorithms for voice processing and audio understanding
- Create scalable systems that handle massive audio workloads
- Build comprehensive evaluation methods to validate model performance
- Collaborate with engineering teams to transition research into production
What you'll bring
- Deep expertise in modern speech technologies (Text-to-Speech, Speech LLMs, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Speech Restoration)
- Strong background in generative modeling for audio and speech
- Publications at leading conferences
- Track record of implementing research ideas from concept to production
Location: | Bay Area |
---|---|
Job type: | Permanent |
Emp type: | Full-time |
Salary type: | Annual |
Salary: | negotiable |
Job published: | 05/06/2025 |
Job ID: | 33251 |