Your search has found 2 jobs

Ready to build speech AI that actually works in real-time?

A well-funded AI startup has developed new model architectures that make real-time conversational AI finally viable at scale. While most voice AI still suffers from delays and computational bottlenecks, they've solved the core efficiency problems that have held the field back. 

The role

As their Speech Research Scientist, you'll build the speech models that could define the next decade of voice interaction. You'll work on novel architectures that have immediate real-world impact for thousands of customers.

What you'll do

  • Design and implement SOTA speech synthesis models
  • Develop efficient algorithms for voice processing and audio understanding
  • Create scalable systems that handle massive audio workloads
  • Build comprehensive evaluation methods to validate model performance
  • Collaborate with engineering teams to transition research into production

What you'll bring

  • Deep expertise in modern speech technologies (Text-to-Speech, Speech LLMs, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Speech Restoration)
  • Strong background in generative modeling for audio and speech
  • Publications at leading conferences
  • Track record of implementing research ideas from concept to production
This role is based in the Bay Area.
 
If you're excited about building the foundational models that will power the Voice AI revolution, we'd love to hear from you.
 
Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 05/06/2025
Job ID: 33251

A fast-growing AI audio startup is looking for a Lead Researcher to spearhead their next breakthrough: a text-to-music platform with advanced vocal synthesis capabilities. With over 5M active users and backing from music industry legends, they've proven their ability to deliver impactful AI audio tools at scale while generating substantial revenue.

Building on their commitment to ethically sourced and licensed data, this is a unique chance to lead the technical direction of a new proprietary platform from the ground up. You'll join a small but experienced team where you'll have true ownership and autonomy, with millions of existing users ready to adopt your innovations.

Core focus:

  • Lead the development of large-scale generative text-to-music models with advanced vocal synthesis
  • Pioneer novel approaches in AI singing and voice synthesis
  • Drive technical strategy and research direction
  • Build and lead the research team as it grows

Requirements:

  • Experience with large diffusion or autoregressive generative music model training
  • Experience with SOTA music generation techniques
  • Experience with vocal synthesis is a plus!

The role offers:

  • Comp DOE €130-€200k plus equity
  • Fully remote (overlap with 9am-2pm PST required)
  • Direct collaboration with the Co-Founders

If you're excited about building ethical, industry-changing technology with immediate real-world impact, we'd love to hear from you.

Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 07/01/2025
Job ID: 32291