Your search has found 7 jobs

Looking to tackle novel speech challenges at scale?

You'll be joining a small but mighty speech AI company building proprietary speech tech from the ground up. With a strong customer base, your research will directly impact production systems serving enterprise customers, with the opportunity to see your work deployed at scale in real-world voice applications.

They're a well-funded startup with healthy revenue streams and immediate opportunities for high-impact research.

Your research

You'll be working on breakthrough speech research that push the boundaries of naturalness and real-time performance. The company has achieved ultra-low latency and is now advancing toward unified speech-to-speech architectures.

You'll develop emotional expression and natural speech generation, advance multilingual support across 30+ languages, and enhance voice cloning robustness.

Your focus

  • Lead cutting-edge research in SOTA speech models (TTS, ASR, or speech-to-speech)
  • Design, execute and iterate on experiments end-to-end
  • Drive speech controllability and naturalness improvements
  • Develop evaluation methodologies for speech quality assessment

What you'll bring

  • Deep understanding of cutting-edge speech models with end-to-end pipeline experience
  • Experience with large-scale model training
  • Strong background in speech model development and optimisation
  • Published work with demonstrable results in industry or academic settings

Nice to have

  • Performance optimisation experience for latency and compute efficiency
  • Experience with model fusion and unified architectures

This is a remote role, either in US or Europe. Competitive comp based on experience.

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 23/09/2025
Job ID: 33913

Ready to pioneer the speech intelligence behind the next generation of embodied AI?

Join a pioneering startup developing foundational technology for natural conversation in embodied agents. You'll advance the speech systems that power avatars with authentic behaviours, real-time expression, and conversational intelligence that handles interruptions and turn-taking just like humans.

This Lead Research Scientist role focuses on advancing real-time speech systems for interactive avatars. You'll develop full-duplex dialogue models and speech-to-speech architectures that enable natural conversational flow, interruption handling, and emotional expression.

Founded by ex-Googlers, they're building proprietary behaviour models that learn from two-way interactions, creating systems where speech timing, prosody, and contextual responses work in harmony with facial expressions and physical behaviours to drive authentic embodied intelligence.

Your focus:

  • Research & develop full-duplex speech systems with natural interruption handling
  • Develop expressive voice models with controllable prosody and timing
  • Build speech-to-speech architectures preserving identity and emotion
  • Create real-time audio generation systems for conversational avatars
  • Publish research while deploying systems in production
  • Collaborate across teams integrating speech with visual behaviour

Requirements:

  • PhD in Speech, Machine Learning, or related field
  • First-author publications at top conferences (Interspeech, ICASSP, NeurIPS, ICLR, etc)
  • Expertise in text-to-speech, speech-to-speech models, or voice cloning
  • Large-scale training experience
  • Experience in prosody modelling or real-time audio generation

Nice to have:

  • Experience with full-duplex speech research
  • Speech-visual alignment expertise (lip sync, expressions)
  • Real-time audio deployment optimisation

Package:

  • Competitive salary $200k- $300k base (based on experience)
  • Meaningful equity package
  • Comprehensive healthcare (90% covered)
  • Unlimited PTO
  • Fully remote work with regular team offsites
  • Life insurance and disability coverage

Location: Fully remote position, globally, with preference for Pacific Time alignment.

Ready to make AI conversations feel authentically human?

Contact Allys at Techire AI. All applicants will receive a response.

 
Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 02/07/2025
Job ID: 33482

Ready to build speech AI that actually works in real-time?

A well-funded AI startup has developed new model architectures that make real-time conversational AI finally viable at scale. While most voice AI still suffers from delays and computational bottlenecks, they've solved the core efficiency problems that have held the field back. 

The role

As their Speech Research Scientist, you'll build the speech models that could define the next decade of voice interaction. You'll work on novel architectures that have immediate real-world impact for thousands of customers.

What you'll do

  • Design and implement SOTA speech synthesis models
  • Develop efficient algorithms for voice processing and audio understanding
  • Create scalable systems that handle massive audio workloads
  • Build comprehensive evaluation methods to validate model performance
  • Collaborate with engineering teams to transition research into production

What you'll bring

  • Deep expertise in modern speech technologies (Text-to-Speech, Speech LLMs, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Speech Restoration)
  • Strong background in generative modeling for audio and speech
  • Publications at leading conferences
  • Track record of implementing research ideas from concept to production
This role is based in the Bay Area.
 
If you're excited about building the foundational models that will power the Voice AI revolution, we'd love to hear from you.
 
Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 05/06/2025
Job ID: 33251

Do you want to create emotionally expressive AI that transforms healthcare conversations?

A pioneering healthtech unicorn is building AI digital health agents designed to safely and empathetically assist patients. Their immediate focus is developing conversational AI with genuine emotional intelligence, with longer-term vision for full-duplex communication capabilities.

As the Staff Research Scientist, you'll play a key part in making this a reality - building foundational speech models that understand and respond with human-like emotion and natural conversation that healthcare demands.

What you'll do

  • Design and develop emotionally expressive speech models for healthcare conversations, working end-to-end from research through to productionizing models
  • Build conversational AI systems that can interpret and respond with appropriate emotional intelligence
  • Work on post-training techniques to enhance speech models' conversational and emotional capabilities
  • Tackle unique challenges including response time optimization, maintaining emotional consistency, and operating in noisy healthcare environments
  • Have the opportunity to publish your groundbreaking research

What you'll bring

  • 5+ years in speech technologies or related field
  • Hands-on experience with speech-to-speech systems (highly preferred), or strong experience in Text-to-Speech, Speech LLMs, emotional/expressive speech synthesis, or similar
  • Experience training large speech datasets
  • Ability to implement research papers from scratch

Bonus points for

  • Experience pre-training foundation models with speech (HuBERT, Wav2Vec, or similar)
  • Multimodal experience
  • Experience with inference technologies (vLLM, CUDA)

You'll be based in the Bay Area or willing to relocate. You'll receive highly competitive comp (up to $350K base DOE) with substantial equity.

If you're excited about creating the next generation of emotionally intelligent speech AI that will revolutionise healthcare communication, click apply!

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 16/04/2025
Job ID: 33086

The future of communication should be bias and barrier-free. That's the vision behind this well-funded start-up pioneering real-time speech algorithms.
 
You'll join a research team on tech that is the first of its kind, improving how we communicate in the real world. By offering clear, natural-sounding conversations regardless of accent or environment. Their ground-breaking technology is already providing impressive results, so it's no wonder they're growing x4 annually.
 
They are creating the whitespace in speech research, and you'll play a key role.
 
As an Senior ML Scientist, you'll work within a talented R&D team advancing core speech algorithms and audio AI models.
 
The role
 
- Contribute to cutting-edge R&D advancing core speech algorithms and Generative Audio models. Continually push boundaries to the next level.
- Tackle unsolved problems in Generative Speech and Audio such as preserving naturalness and performance in noisy environments.
- End-to-end ownership of models, from data collection to training on the cloud.
- Develop novel architectures balancing cutting-edge performance with real-time efficiency & low-latency
- Collaborate with top scientists in this field 
 
You'll have
 
- 4+ years of industry experience developing and implementing either of the following: TTS, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Accent Translation, Speech Restoration
- Proven background contributing to well-known research publications and/or products in these areas
- PhD or degree in Computer Science, ML, or related field. 
- Proven experience with PyTorch, TensorFlow and modern DL techniques such as GANs, VAEs, diffusion or flow models, etc.
- Familiar with cloud-based technologies and production environments
 
What you'll get in return
 
- Benefits include a competitive salary, share options, unlimited PTO health coverage, and a VPO plan.
- Contributing to whitespace in speech technology research, you'll have control over the direction of your work with no friction whatsoever. You're the expert after all.
 
If you're looking to make an impact, there are few better places to do it. Your work here has the power to improve communication, eliminate confusion, and create a more connected world.
 
If you want the freedom to shape the future of speech AI, apply now.

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: USD $300,000.00
Job published: 26/02/2025
Job ID: 32723

Pioneer the future of visual AI creation

Looking to shape how millions experience generative visual content? A fast-growing AI startup seeks you, a vision generation researcher to advance their agentic approach to image and video synthesis technology.

This innovative team has already launched a successful product in the visual creativity space and is now expanding their research capabilities to drive the next phase of development.

Your expertise The ideal candidate brings research depth in at least one key area:

  • Advanced image synthesis - Creating high-fidelity visuals, exploring 3D neural rendering, or developing large vision models
  • Next-generation video AI - Building systems for fluid motion generation, temporal consistency, or efficient video creation pipelines
  • Cross-modal integration - Connecting visual generation with text, audio, or other input modalities

You thrive on implementing and improving state-of-the-art techniques, whether working with diffusion models, adversarial networks, or transformer-based architectures. Your technical foundation is matched by your creativity in pushing these technologies to new horizons.

Your impact You'll conduct foundational research that directly shapes product development, bridging the gap between academic innovation and real-world applications. Your work will enhance a platform already reaching a substantial user base, influencing how people interact with AI-generated visual content.

Your environment You'll collaborate with a close-knit, passionate team that values both technical excellence and creative vision.

Your package

  • Competitive salary: $200,000-$225,000 (negotiable based on experience)
  • Equity package
  • Comprehensive healthcare benefits
  • Additional perks and flexible work arrangements

Based in New York City with a hybrid work model, though remote arrangements will be considered for exceptional candidates.

Ready to transform the landscape of creative visual AI? Contact Marc Powell at techire.ai for a confidential discussion or submit your application today.

Location: New York, NY
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 26/02/2025
Job ID: 32690

A fast-growing AI audio startup is looking for a Lead Researcher to spearhead their next breakthrough: a text-to-music platform with advanced vocal synthesis capabilities. With over 5M active users and backing from music industry legends, they've proven their ability to deliver impactful AI audio tools at scale while generating substantial revenue.

Building on their commitment to ethically sourced and licensed data, this is a unique chance to lead the technical direction of a new proprietary platform from the ground up. You'll join a small but experienced team where you'll have true ownership and autonomy, with millions of existing users ready to adopt your innovations.

Core focus:

  • Lead the development of large-scale generative text-to-music models with advanced vocal synthesis
  • Pioneer novel approaches in AI singing and voice synthesis
  • Drive technical strategy and research direction
  • Build and lead the research team as it grows

Requirements:

  • Experience with large diffusion or autoregressive generative music model training
  • Experience with SOTA music generation techniques
  • Experience with vocal synthesis is a plus!

The role offers:

  • Comp DOE €130-€200k plus equity
  • Fully remote (overlap with 9am-2pm PST required)
  • Direct collaboration with the Co-Founders

If you're excited about building ethical, industry-changing technology with immediate real-world impact, we'd love to hear from you.

Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 07/01/2025
Job ID: 32291