Your search has found 3 jobs

Looking to define ASR strategy for the next generation of social AI?

You'll be joining a well-funded social AI company building lifelike AI characters that interact naturally across voice, video, and text. Founded by a prominent tech entrepreneur, they're creating new media formats for AI-driven interaction where agents handle group conversations, interruptions, and multi-agent dynamics.

Your mission

You'll own the ASR function from day one - starting with evaluating and implementing existing solutions, then moving toward building proprietary models as the platform scales. This means hands-on work testing APIs and open-source models, followed by developing custom systems for multi-agent group conversations and social interactions.

You'll shape the technical direction, balance short-term delivery with long-term innovation, and drive individual research initiatives while collaborating on broader team objectives.

Your focus

  • Define and execute the ASR roadmap from evaluation through production deployment
  • Build and train models that handle natural conversation dynamics
  • Develop evaluation systems to measure accuracy, speed, and reliability
  • Define data requirements and create pipelines for ASR training
  • Work from low-level performance optimizations to high-level architecture decisions

What you'll bring

  • Proven track record building and deploying ASR systems at scale
  • Strong familiarity with SOTA ASR models and architectures (Whisper, Conformer, etc.)
  • Understanding of data quality assessment for speech systems

Nice to have

  • Experience leading technical initiatives or ML teams

Remote with competitive comp + stock.

Ready to define the future of social AI interactions? Apply today.

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 05/12/2025
Job ID: 34546

Looking to tackle novel speech challenges at scale?

You'll be joining a small but mighty speech AI company building proprietary speech tech from the ground up. With a strong customer base, your research will directly impact production systems serving enterprise customers, with the opportunity to see your work deployed at scale in real-world voice applications.

They're a well-funded startup with healthy revenue streams and immediate opportunities for high-impact research.

Your research

You'll be working on breakthrough speech research that push the boundaries of naturalness and real-time performance. The company has achieved ultra-low latency and is now advancing toward unified speech-to-speech architectures.

You'll develop emotional expression and natural speech generation, advance multilingual support across 30+ languages, and enhance voice cloning robustness.

Your focus

  • Lead cutting-edge research in SOTA speech models (TTS, ASR, or speech-to-speech)
  • Design, execute and iterate on experiments end-to-end
  • Drive speech controllability and naturalness improvements
  • Develop evaluation methodologies for speech quality assessment

What you'll bring

  • Deep understanding of cutting-edge speech models with end-to-end pipeline experience
  • Experience with large-scale model training
  • Strong background in speech model development and optimisation
  • Published work with demonstrable results in industry or academic settings

Nice to have

  • Performance optimisation experience for latency and compute efficiency
  • Experience with model fusion and unified architectures

This is a remote role, either in US or Europe. Competitive comp based on experience.

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 23/09/2025
Job ID: 33913

Looking to solve complex ASR challenges at scale?

You'll be joining an established conversational AI company with proprietary in-house speech models that process billions of interactions annually, where your ASR expertise will directly impact real-world customer experiences.

You'll be tackling demanding ASR problems in production environments: streaming speech recognition in noisy conditions, robust accent handling and maintaining high performance at scale.

Working across key production environments, you'll enhance speech capabilities and bring new features to production that push the boundaries of what's possible in challenging acoustic environments.

 

Your focus

  • Maintain and iteratively improve existing ASR technology while introducing cutting-edge enhancements
  • Work end-to-end across speech processing components: speech enhancement, VAD, diarisation, and ASR (AM/LM modelling, ASR biasing)
  • Build streaming ASR systems optimised for challenging acoustic environments
  • Implement emotion detection and acoustic condition classification capabilities
  • Run extensive experiments to advance activity detection and speech processing performance

 

What you'll bring

  • Strong background in ASR model development and deployment
  • Hands-on experience with SOTA speech toolkits (Kaldi, K2, NVIDIA NeMo, Parakeet)
  • Proven streaming ASR experience in production environments

 

This is a fully remote role – must be close to EU timezone. 

Ready to make your mark on speech tech that millions rely on daily? Apply today.

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 12/08/2025
Job ID: 33647