Your search has found 2 jobs

Ready to drive the next breakthrough in speech AI?

This rapidly growing, well-funded tech startup is redefining speech technology with industry-leading accuracy and latency. Their voice AI platform is already outperforming established competitors while supporting languages traditionally overlooked by mainstream technologies.

With a strong client base and significant investment secured, they're now at an exciting inflection point - focusing on building proprietary speech AI systems from the ground up. This presents a rare opportunity to shape the future of multilingual speech technology.

The role

As Lead Speech Research Engineer, you'll direct the technical vision for their next phase of innovation. You'll have the freedom to make foundational architectural decisions and establish new research directions.

You'll work at the intersection of cutting-edge research and practical implementation, with the autonomy to build solutions that push the boundaries of what's possible in speech AI.

What makes this opportunity special

  • You'll define research strategy rather than implementing someone else's vision
  • Build something entirely new with technical freedom
  • Directly influence product direction and company growth
  • See your innovations impact real-world applications
  • Opportunities to attend conferences and publish research in the future

What you'll do

  • Shape the research vision and technical architecture for next-generation speech models
  • Develop E2E speech models with proprietary solutions that push performance boundaries
  • Guide a growing R&D team on novel approaches and architectural innovations
  • Create systems that raise the bar for accuracy, latency, and multilingual performance
  • Establish best practices for data preparation and model optimization

What you'll bring

  • 8+ years of experience developing ML systems for audio applications
  • Proven success building speech models end-to-end, not just fine-tuning existing ones
  • Experience with large-scale model training and optimization
  • 1st author publications at top-tier speech/audio conferences 
  • Deep understanding of modern speech processing techniques and architectures

You'll get

  • To work remotely (in EU or US Eastern timezone) or hybrid in Europe. You'll also get to meet periodically with for team gatherings in Europe.
  • Salary up to €200K DOE
  • Unlimited PTO and comprehensive health coverage
  • Generous pension plan and lifestyle benefits

If you're passionate about pushing the boundaries of speech AI and want the freedom to build something revolutionary, this is your opportunity to make a significant impact. Get in touch today!

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 11/04/2025
Job ID: 33053

The future of communication should be bias and barrier-free. That's the vision behind this well-funded start-up pioneering real-time speech algorithms.
 
You'll join a research team on tech that is the first of its kind, improving how we communicate in the real world. By offering clear, natural-sounding conversations regardless of accent or environment. Their ground-breaking technology is already providing impressive results, so it's no wonder they're growing x4 annually.
 
They are creating the whitespace in speech research, and you'll play a key role.
 
As an Senior ML Scientist, you'll work within a talented R&D team advancing core speech algorithms and audio AI models.
 
The role
 
- Contribute to cutting-edge R&D advancing core speech algorithms and Generative Audio models. Continually push boundaries to the next level.
- Tackle unsolved problems in Generative Speech and Audio such as preserving naturalness and performance in noisy environments.
- End-to-end ownership of models, from data collection to training on the cloud.
- Develop novel architectures balancing cutting-edge performance with real-time efficiency & low-latency
- Collaborate with top scientists in this field 
 
You'll have
 
- 4+ years of industry experience developing and implementing either of the following: TTS, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Accent Translation, Speech Restoration
- Proven background contributing to well-known research publications and/or products in these areas
- PhD or degree in Computer Science, ML, or related field. 
- Proven experience with PyTorch, TensorFlow and modern DL techniques such as GANs, VAEs, diffusion or flow models, etc.
- Familiar with cloud-based technologies and production environments
 
What you'll get in return
 
- Benefits include a competitive salary, share options, unlimited PTO health coverage, and a VPO plan.
- Contributing to whitespace in speech technology research, you'll have control over the direction of your work with no friction whatsoever. You're the expert after all.
 
If you're looking to make an impact, there are few better places to do it. Your work here has the power to improve communication, eliminate confusion, and create a more connected world.
 
If you want the freedom to shape the future of speech AI, apply now.

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: USD $300,000.00
Job published: 26/02/2025
Job ID: 32723