Your search has found 5 jobs

The future of communication should be bias and barrier-free. That's the vision behind this well-funded start-up pioneering real-time speech algorithms.
 
You'll join a research team on tech that is the first of its kind, improving how we communicate in the real world. By offering clear, natural-sounding conversations regardless of accent or environment. Their ground-breaking technology is already providing impressive results, so it's no wonder they're growing x4 annually.
 
They are creating the whitespace in speech research, and you'll play a key role.
 
As an Senior ML Scientist, you'll work within a talented R&D team advancing core speech algorithms and audio AI models.
 
The role
 
- Contribute to cutting-edge R&D advancing core speech algorithms and Generative Audio models. Continually push boundaries to the next level.
- Tackle unsolved problems in Generative Speech and Audio such as preserving naturalness and performance in noisy environments.
- End-to-end ownership of models, from data collection to training on the cloud.
- Develop novel architectures balancing cutting-edge performance with real-time efficiency & low-latency
- Collaborate with top scientists in this field 
 
You'll have
 
- 4+ years of industry experience developing and implementing either of the following: TTS, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Accent Translation, Speech Restoration
- Proven background contributing to well-known research publications and/or products in these areas
- PhD or degree in Computer Science, ML, or related field. 
- Proven experience with PyTorch, TensorFlow and modern DL techniques such as GANs, VAEs, diffusion or flow models, etc.
- Familiar with cloud-based technologies and production environments
 
What you'll get in return
 
- Benefits include a competitive salary, share options, unlimited PTO health coverage, and a VPO plan.
- Contributing to whitespace in speech technology research, you'll have control over the direction of your work with no friction whatsoever. You're the expert after all.
 
If you're looking to make an impact, there are few better places to do it. Your work here has the power to improve communication, eliminate confusion, and create a more connected world.
 
If you want the freedom to shape the future of speech AI, apply now.

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: USD $300,000.00
Job published: 26/02/2025
Job ID: 32723

Pioneer the future of visual AI creation

Looking to shape how millions experience generative visual content? A fast-growing AI startup seeks you, a vision generation researcher to advance their agentic approach to image and video synthesis technology.

This innovative team has already launched a successful product in the visual creativity space and is now expanding their research capabilities to drive the next phase of development.

Your expertise The ideal candidate brings research depth in at least one key area:

  • Advanced image synthesis - Creating high-fidelity visuals, exploring 3D neural rendering, or developing large vision models
  • Next-generation video AI - Building systems for fluid motion generation, temporal consistency, or efficient video creation pipelines
  • Cross-modal integration - Connecting visual generation with text, audio, or other input modalities

You thrive on implementing and improving state-of-the-art techniques, whether working with diffusion models, adversarial networks, or transformer-based architectures. Your technical foundation is matched by your creativity in pushing these technologies to new horizons.

Your impact You'll conduct foundational research that directly shapes product development, bridging the gap between academic innovation and real-world applications. Your work will enhance a platform already reaching a substantial user base, influencing how people interact with AI-generated visual content.

Your environment You'll collaborate with a close-knit, passionate team that values both technical excellence and creative vision.

Your package

  • Competitive salary: $200,000-$225,000 (negotiable based on experience)
  • Equity package
  • Comprehensive healthcare benefits
  • Additional perks and flexible work arrangements

Based in New York City with a hybrid work model, though remote arrangements will be considered for exceptional candidates.

Ready to transform the landscape of creative visual AI? Contact Marc Powell at techire.ai for a confidential discussion or submit your application today.

Location: New York, NY
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 26/02/2025
Job ID: 32690

A fast-growing AI audio startup is looking for a Lead Researcher to spearhead their next breakthrough: a text-to-music platform with advanced vocal synthesis capabilities. With over 5M active users and backing from music industry legends, they've proven their ability to deliver impactful AI audio tools at scale while generating substantial revenue.

Building on their commitment to ethically sourced and licensed data, this is a unique chance to lead the technical direction of a new proprietary platform from the ground up. You'll join a small but experienced team where you'll have true ownership and autonomy, with millions of existing users ready to adopt your innovations.

Core focus:

  • Lead the development of large-scale generative text-to-music models with advanced vocal synthesis
  • Pioneer novel approaches in AI singing and voice synthesis
  • Drive technical strategy and research direction
  • Build and lead the research team as it grows

Requirements:

  • Experience with large diffusion or autoregressive generative music model training
  • Experience with SOTA music generation techniques
  • Experience with vocal synthesis is a plus!

The role offers:

  • Comp DOE €130-€200k plus equity
  • Fully remote (overlap with 9am-2pm PST required)
  • Direct collaboration with the Co-Founders

If you're excited about building ethical, industry-changing technology with immediate real-world impact, we'd love to hear from you.

Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 07/01/2025
Job ID: 32291

Ready to own the complete ML stack while reshaping how Generative AI transforms advertising?

Join this start-up as Founding ML Engineer building a Generative AI based advertising platform.

You’ll contribute to a sophisticated solution that leverages consumer interaction data and generative models to create personalized ads across multiple modalities - from language to image and video generation.

Despite being only 5 people strong, they're already generating revenue and have secured seed funding from prestigious investors including a16z. Now they're ready to scale and tackle even more complex challenges with their AI-based system.

In this role, you'll be solving complex technical challenges such as implementing RLHF to enhance system intelligence, advanced prompting and fine-tuning of LLMs, and improving user efficiency. You'll have significant autonomy, owning the entire pipeline and keeping up to date with the state of the art to maintain top-level performance.

The role combines research, development, and experimentation, with approximately 20-30% focused on research, 40% on building and training models, and 30-40% on experimentation, performance improvement and deployments. While primarily focused on language/text, the team works across multiple modalities, so an interest in multimodal applications would be valuable.

Requirements:

  • 2+ years industry experience
  • Strong expertise in NLP and LLMs (prompting and fine-tuning)
  • Experience with RLHF implementations
  • Solid foundations in Deep Learning
  • Proven track record in building production ML systems

Experience with recommendation systems or ad tech would be beneficial but isn't essential.

Your package includes a base salary of up to $225k (negotiable based on experience), plus a 10% bonus, competitive equity, healthcare benefits, unlimited holiday and more. Total compensation will be adjusted relative to your level of experience.

The role is based on-site in San Francisco, with visa sponsorship available for exceptional candidates.

The interview process is straightforward:

  1. Initial call with recruiter (30 mins)
  2. Conversation with Founder (30 mins)
  3. Technical assessment or on-site interview
  4. Team meeting (can be combined with stage 3)

If you're excited about applying cutting-edge AI to transform the $600B advertising industry, we'd love to hear from you. All applicants will receive a response.

Location: San Francisco, CA
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 19/11/2024
Job ID: 32292

Are you looking to lead impactful advancements in generative audio?

This fast-growing AI voice start-up is seeking an exceptional Senior ML Researcher to drive strategy and cutting-edge R&D for their transformative voice products.

Leveraging AI techniques, their technology empowers millions of users with powerful tools for vocal audio creation and customisation.

In this vital role, you'll provide strategic leadership and help shape the foundations for their new ML research organisation. Utilising your extensive background in speech technology, generative modelling, and audio ML, you'll spearhead advancements into new generative audio frontiers.

Your core responsibilities will include:

  • Defining and providing long-term research strategy development to unlock novel generative speech and audio capabilities
  • Executing the ML research roadmap to enhance existing products
  • Collaborating closely with an experienced engineering team skilled at productionising ML, allowing you to focus on pioneering research
  • Helping build and grow the ML Research team

You'll bring:

  • 4+ years of experience leading impactful ML research for speech synthesis, speech-to-speech, TTS, voice conversion or related speech/audio domains
  • Extensive hands-on expertise with state-of-the-art models & techniques for speech and audio synthesis such as Variational Temporal Autoencoders, self-supervised models like ContentVec/Hubert etc.
  • Proven experience launching impactful research initiatives and strategies within the audio/speech domain.

The opportunity offers:

  • Competitive salary (up to $200k depending on experience) and stock options
  • Fully remote with flexible working hours
  • Annual team offsites to exciting global destinations

 

This is a rare chance to drive cutting-edge generative audio AI research. If pushing the boundaries of voice technology excites you, apply now!

Location: Remote - Global
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 21/05/2024
Job ID: 32252