Your search has found 6 jobs

Do you want to create AI that converses as naturally as humans do? A pioneering healthtech unicorn is building AI digital health agents designed to safely and empathetically assist patients.

As the Staff Research Scientist, you'll play a key part in making this a reality - building end-to-end foundational speech models capable of full-duplex communication. This isn't just about taking turns speaking; it's about creating AI that can listen and respond simultaneously with human-like conversation, emotions, and natural language that healthcare demands.

What you'll do

  • Design and develop novel speech foundation models for healthcare conversations, working end-to-end from research through to productionizing models
  • Work on post-training LLMs for speech to enhance their conversational capabilities
  • Tackle unique challenges including response time optimization, maintaining alignment between text and speech outputs, and operating in noisy environments
  • Create innovative approaches to synthetic conversational data generation
  • Have the opportunity to publish your groundbreaking research

What you'll bring

  • PhD with 8+ years in speech technologies or related field
  • Experience with Speech LLMs
  • Experience training large datasets
  • Strong publication record at top-tier conferences in speech/multimodal AI
  • Ability to implement research papers from scratch

Bonus points for

  • Experience pre-training foundation models with speech (HuBERT, Wav2Vec, or similar)
  • Multimodal experience
  • Experience with inference technologies (vLLM, CUDA)

You'll be based in the Bay Area and will receive highly competitive comp (up to $350K base DOE) with substantial equity.

If you're excited about creating the next generation of speech AI that will revolutionize healthcare communication, click apply!

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 16/04/2025
Job ID: 33086

Ready to drive the next breakthrough in speech AI?

This rapidly growing, well-funded tech startup is redefining speech technology with industry-leading accuracy and latency. Their voice AI platform is already outperforming established competitors while supporting languages traditionally overlooked by mainstream technologies.

With a strong client base and significant investment secured, they're now at an exciting inflection point - focusing on building proprietary speech AI systems from the ground up. This presents a rare opportunity to shape the future of multilingual speech technology.

The role

As Lead Speech Research Engineer, you'll direct the technical vision for their next phase of innovation. You'll have the freedom to make foundational architectural decisions and establish new research directions.

You'll work at the intersection of cutting-edge research and practical implementation, with the autonomy to build solutions that push the boundaries of what's possible in speech AI.

What makes this opportunity special

  • You'll define research strategy rather than implementing someone else's vision
  • Build something entirely new with technical freedom
  • Directly influence product direction and company growth
  • See your innovations impact real-world applications
  • Opportunities to attend conferences and publish research in the future

What you'll do

  • Shape the research vision and technical architecture for next-generation speech models
  • Develop E2E speech models with proprietary solutions that push performance boundaries
  • Guide a growing R&D team on novel approaches and architectural innovations
  • Create systems that raise the bar for accuracy, latency, and multilingual performance
  • Establish best practices for data preparation and model optimization

What you'll bring

  • 8+ years of experience developing ML systems for audio applications
  • Proven success building speech models end-to-end, not just fine-tuning existing ones
  • Experience with large-scale model training and optimization
  • 1st author publications at top-tier speech/audio conferences 
  • Deep understanding of modern speech processing techniques and architectures

You'll get

  • To work remotely (in EU or US Eastern timezone) or hybrid in Europe. You'll also get to meet periodically with for team gatherings in Europe.
  • Salary up to €200K DOE
  • Unlimited PTO and comprehensive health coverage
  • Generous pension plan and lifestyle benefits

If you're passionate about pushing the boundaries of speech AI and want the freedom to build something revolutionary, this is your opportunity to make a significant impact. Get in touch today!

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 11/04/2025
Job ID: 33053

The future of communication should be bias and barrier-free. That's the vision behind this well-funded start-up pioneering real-time speech algorithms.
 
You'll join a research team on tech that is the first of its kind, improving how we communicate in the real world. By offering clear, natural-sounding conversations regardless of accent or environment. Their ground-breaking technology is already providing impressive results, so it's no wonder they're growing x4 annually.
 
They are creating the whitespace in speech research, and you'll play a key role.
 
As an Senior ML Scientist, you'll work within a talented R&D team advancing core speech algorithms and audio AI models.
 
The role
 
- Contribute to cutting-edge R&D advancing core speech algorithms and Generative Audio models. Continually push boundaries to the next level.
- Tackle unsolved problems in Generative Speech and Audio such as preserving naturalness and performance in noisy environments.
- End-to-end ownership of models, from data collection to training on the cloud.
- Develop novel architectures balancing cutting-edge performance with real-time efficiency & low-latency
- Collaborate with top scientists in this field 
 
You'll have
 
- 4+ years of industry experience developing and implementing either of the following: TTS, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Accent Translation, Speech Restoration
- Proven background contributing to well-known research publications and/or products in these areas
- PhD or degree in Computer Science, ML, or related field. 
- Proven experience with PyTorch, TensorFlow and modern DL techniques such as GANs, VAEs, diffusion or flow models, etc.
- Familiar with cloud-based technologies and production environments
 
What you'll get in return
 
- Benefits include a competitive salary, share options, unlimited PTO health coverage, and a VPO plan.
- Contributing to whitespace in speech technology research, you'll have control over the direction of your work with no friction whatsoever. You're the expert after all.
 
If you're looking to make an impact, there are few better places to do it. Your work here has the power to improve communication, eliminate confusion, and create a more connected world.
 
If you want the freedom to shape the future of speech AI, apply now.

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: USD $300,000.00
Job published: 26/02/2025
Job ID: 32723

Pioneer the future of visual AI creation

Looking to shape how millions experience generative visual content? A fast-growing AI startup seeks you, a vision generation researcher to advance their agentic approach to image and video synthesis technology.

This innovative team has already launched a successful product in the visual creativity space and is now expanding their research capabilities to drive the next phase of development.

Your expertise The ideal candidate brings research depth in at least one key area:

  • Advanced image synthesis - Creating high-fidelity visuals, exploring 3D neural rendering, or developing large vision models
  • Next-generation video AI - Building systems for fluid motion generation, temporal consistency, or efficient video creation pipelines
  • Cross-modal integration - Connecting visual generation with text, audio, or other input modalities

You thrive on implementing and improving state-of-the-art techniques, whether working with diffusion models, adversarial networks, or transformer-based architectures. Your technical foundation is matched by your creativity in pushing these technologies to new horizons.

Your impact You'll conduct foundational research that directly shapes product development, bridging the gap between academic innovation and real-world applications. Your work will enhance a platform already reaching a substantial user base, influencing how people interact with AI-generated visual content.

Your environment You'll collaborate with a close-knit, passionate team that values both technical excellence and creative vision.

Your package

  • Competitive salary: $200,000-$225,000 (negotiable based on experience)
  • Equity package
  • Comprehensive healthcare benefits
  • Additional perks and flexible work arrangements

Based in New York City with a hybrid work model, though remote arrangements will be considered for exceptional candidates.

Ready to transform the landscape of creative visual AI? Contact Marc Powell at techire.ai for a confidential discussion or submit your application today.

Location: New York, NY
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 26/02/2025
Job ID: 32690

A fast-growing AI audio startup is looking for a Lead Researcher to spearhead their next breakthrough: a text-to-music platform with advanced vocal synthesis capabilities. With over 5M active users and backing from music industry legends, they've proven their ability to deliver impactful AI audio tools at scale while generating substantial revenue.

Building on their commitment to ethically sourced and licensed data, this is a unique chance to lead the technical direction of a new proprietary platform from the ground up. You'll join a small but experienced team where you'll have true ownership and autonomy, with millions of existing users ready to adopt your innovations.

Core focus:

  • Lead the development of large-scale generative text-to-music models with advanced vocal synthesis
  • Pioneer novel approaches in AI singing and voice synthesis
  • Drive technical strategy and research direction
  • Build and lead the research team as it grows

Requirements:

  • Experience with large diffusion or autoregressive generative music model training
  • Experience with SOTA music generation techniques
  • Experience with vocal synthesis is a plus!

The role offers:

  • Comp DOE €130-€200k plus equity
  • Fully remote (overlap with 9am-2pm PST required)
  • Direct collaboration with the Co-Founders

If you're excited about building ethical, industry-changing technology with immediate real-world impact, we'd love to hear from you.

Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 07/01/2025
Job ID: 32291

Ready to own the complete ML stack while reshaping how Generative AI transforms advertising?

Join this start-up as Founding ML Engineer building a Generative AI based advertising platform.

You’ll contribute to a sophisticated solution that leverages consumer interaction data and generative models to create personalized ads across multiple modalities - from language to image and video generation.

Despite being only 5 people strong, they're already generating revenue and have secured seed funding from prestigious investors including a16z. Now they're ready to scale and tackle even more complex challenges with their AI-based system.

In this role, you'll be solving complex technical challenges such as implementing RLHF to enhance system intelligence, advanced prompting and fine-tuning of LLMs, and improving user efficiency. You'll have significant autonomy, owning the entire pipeline and keeping up to date with the state of the art to maintain top-level performance.

The role combines research, development, and experimentation, with approximately 20-30% focused on research, 40% on building and training models, and 30-40% on experimentation, performance improvement and deployments. While primarily focused on language/text, the team works across multiple modalities, so an interest in multimodal applications would be valuable.

Requirements:

  • 2+ years industry experience
  • Strong expertise in NLP and LLMs (prompting and fine-tuning)
  • Experience with RLHF implementations
  • Solid foundations in Deep Learning
  • Proven track record in building production ML systems

Experience with recommendation systems or ad tech would be beneficial but isn't essential.

Your package includes a base salary of up to $225k (negotiable based on experience), plus a 10% bonus, competitive equity, healthcare benefits, unlimited holiday and more. Total compensation will be adjusted relative to your level of experience.

The role is based on-site in San Francisco, with visa sponsorship available for exceptional candidates.

The interview process is straightforward:

  1. Initial call with recruiter (30 mins)
  2. Conversation with Founder (30 mins)
  3. Technical assessment or on-site interview
  4. Team meeting (can be combined with stage 3)

If you're excited about applying cutting-edge AI to transform the $600B advertising industry, we'd love to hear from you. All applicants will receive a response.

Location: San Francisco, CA
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 19/11/2024
Job ID: 32292