Job title: Lead Applied ML Scientist - TTS
Job type: Permanent
Emp type: Full-time
Industry: Generative AI
Functional Expertise: Deep Learning Gen-Speech/TTS Machine Learning SOTA Speech
Salary type: Annual
Salary: negotiable
Location: Bay Area
Job published: 14/02/2024
Job ID: 32241

Job Description

The future of communication should be bias and barrier-free. That's the vision behind this well-funded start-up pioneering real-time speech algorithms.

 

You’ll join a research team on tech that is the first of its kind, improving how we communicate in the real world. By offering clear, natural-sounding conversations regardless of accent or environment. Their ground-breaking technology is already providing impressive results, so it’s no wonder they’re growing x4 annually.

 

They are creating the whitespace in speech research, and you’ll play a key role.

 

As a Lead Applied ML Scientist, you’ll manage a small but mighty R&D team advancing core speech algorithms and audio AI models.

 

The Role

 

  • Lead cutting-edge R&D advancing core speech algorithms and Generative Audio models. Continually push boundaries to the next level.
  • Tackle unsolved problems in Generative Speech and Audio such as preserving naturalness and performance in noisy environments.
  • End-to-end ownership of models, from data collection to training on the cloud.
  • Develop novel architectures balancing cutting-edge performance with real-time efficiency & low-latency
  • Lead a team of top scientists in this field 

 

You'll Have

 

  • 4+ years of industry experience developing and implementing either of the following: Text-to-Speech (TTS), Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Accent Translation, Speech Restoration
  • Proven background contributing to well-known research publications and/or products in these areas
  • Degree in Computer Science, ML, or related field. PhD is nice to have
  • Experience managing a small team (resource management, performance reviews etc)
  • Proven experience with PyTorch, TensorFlow and modern DL techniques such as GANs, VAEs, diffusion or flow models, etc.
  • Familiar with cloud-based technologies and production environments

 

 

What you’ll get in return

 

  • A negotiable and genuinely open package for the right candidate, we're looking for the best people who want to work in an environment of freedom and autonomy.
  • Benefits include a competitive salary ($500k+ depending on experience). You'll also get share options, unlimited PTO health coverage, and a VPO plan.
  • Contributing to whitespace in speech technology research, you'll have control over the direction of your work with no friction whatsoever. You’re the expert after all.

 

If you’re looking to make an impact, there are few better places to do it. Your work here has the power to improve communication, eliminate confusion, and create a more connected world.

 

If you want the freedom to shape the future of speech AI, apply now.