Your search has found 2 jobs

Looking to tackle novel speech challenges at scale?

You'll be joining a small but mighty speech AI company building proprietary speech tech from the ground up. With a strong customer base, your research will directly impact production systems serving enterprise customers, with the opportunity to see your work deployed at scale in real-world voice applications.

They're a well-funded startup with healthy revenue streams and immediate opportunities for high-impact research.

Your research

You'll be working on breakthrough speech research that push the boundaries of naturalness and real-time performance. The company has achieved ultra-low latency and is now advancing toward unified speech-to-speech architectures.

You'll develop emotional expression and natural speech generation, advance multilingual support across 30+ languages, and enhance voice cloning robustness.

Your focus

  • Lead cutting-edge research in SOTA speech models (TTS, ASR, or speech-to-speech)
  • Design, execute and iterate on experiments end-to-end
  • Drive speech controllability and naturalness improvements
  • Develop evaluation methodologies for speech quality assessment

What you'll bring

  • Deep understanding of cutting-edge speech models with end-to-end pipeline experience
  • Experience with large-scale model training
  • Strong background in speech model development and optimisation
  • Published work with demonstrable results in industry or academic settings

Nice to have

  • Performance optimisation experience for latency and compute efficiency
  • Experience with model fusion and unified architectures

This is a remote role, either in US or Europe. Competitive comp based on experience.

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 23/09/2025
Job ID: 33913

Looking to solve complex ASR challenges at scale?

You'll be joining an established conversational AI company with proprietary in-house speech models that process billions of interactions annually, where your ASR expertise will directly impact real-world customer experiences.

You'll be tackling demanding ASR problems in production environments: streaming speech recognition in noisy conditions, robust accent handling and maintaining high performance at scale.

Working across key production environments, you'll enhance speech capabilities and bring new features to production that push the boundaries of what's possible in challenging acoustic environments.

 

Your focus

  • Maintain and iteratively improve existing ASR technology while introducing cutting-edge enhancements
  • Work end-to-end across speech processing components: speech enhancement, VAD, diarisation, and ASR (AM/LM modelling, ASR biasing)
  • Build streaming ASR systems optimised for challenging acoustic environments
  • Implement emotion detection and acoustic condition classification capabilities
  • Run extensive experiments to advance activity detection and speech processing performance

 

What you'll bring

  • Strong background in ASR model development and deployment
  • Hands-on experience with SOTA speech toolkits (Kaldi, K2, NVIDIA NeMo, Parakeet)
  • Proven streaming ASR experience in production environments

 

This is a fully remote role – must be close to EU timezone. 

Ready to make your mark on speech tech that millions rely on daily? Apply today.

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 12/08/2025
Job ID: 33647