Your search has found 23 jobs

Ready to build speech AI that actually works in real-time?

A well-funded AI startup has developed new model architectures that make real-time conversational AI finally viable at scale. While most voice AI still suffers from delays and computational bottlenecks, they've solved the core efficiency problems that have held the field back. 

The role

As their Speech Research Scientist, you'll build the speech models that could define the next decade of voice interaction. You'll work on novel architectures that have immediate real-world impact for thousands of customers.

What you'll do

  • Design and implement SOTA speech synthesis models
  • Develop efficient algorithms for voice processing and audio understanding
  • Create scalable systems that handle massive audio workloads
  • Build comprehensive evaluation methods to validate model performance
  • Collaborate with engineering teams to transition research into production

What you'll bring

  • Deep expertise in modern speech technologies (Text-to-Speech, Speech LLMs, Voice Conversion/Cloning, Speech Synthesis, Speech Translation, Speech Restoration)
  • Strong background in generative modeling for audio and speech
  • Publications at leading conferences
  • Track record of implementing research ideas from concept to production
This role is based in the Bay Area.
 
If you're excited about building the foundational models that will power the Voice AI revolution, we'd love to hear from you.
 
Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 05/06/2025
Job ID: 33251

Build the ML infrastructure that powers cutting-edge AI across multiple domains

Ready to architect MLOps systems from the ground up for a fast-growing AI team? This greenfield opportunity offers complete autonomy to design and build training pipelines for LLMs, computer vision models, and other deep learning architectures that will power next-generation AI applications.

You'll join a well-funded startup ($20M+ raised, with a new round expected this year) developing production-grade AI solutions across regulated industries including healthcare, aerospace and manufacturing. Founded by a successful entrepreneur with a previous billion-dollar exit, they're already partnering with Fortune 100 and 500 clients where standard AI approaches fall short.

This role offers exceptional technical ownership - you'll build their ML infrastructure from current basic tooling to production-scale systems that support their rapidly expanding applied AI team. They have significant GPU resources with substantial budget growth expected. As the team scales to ~20 people within the year, there's high potential for you to lead future MLOps hires.

The challenge is substantial: creating infrastructure that supports training across multiple modalities - from LLMs to computer vision models. You'll work with large compute resources and have complete autonomy to select and implement the tooling that will define how the team operates for years to come. Your initial focus will be establishing robust training and evaluation pipelines, then scaling to enterprise-grade data workflows with versioning, monitoring, and automated deployment systems.

Your focus:

  • Build training and evaluation pipelines for LLMs, vision models, and other deep learning architectures
  • Design distributed training systems on multi-GPU clusters across model types
  • Create scalable data pipelines, versioning systems, and model checkpointing workflows
  • Implement model serving infrastructure with tools like vLLM, Triton, and TorchServe
  • Establish comprehensive monitoring, experiment tracking, and reproducibility systems
  • Support a rapidly growing applied AI team with robust CI/CD workflows for ML systems

You should have:

  • 3+ years building MLOps infrastructure or ML systems in production environments
  • Hands-on experience with training pipelines for deep learning models (LLMs, CNNs, transformers)
  • Strong expertise with AWS and Kubernetes (mandatory requirements)
  • Proficiency with Python, PyTorch/TensorFlow, and distributed training libraries
  • Experience with model tracking tools like Weights & Biases or MLflow
  • Understanding of modern ML architectures across multiple domains

Nice to have:

  • Experience with LLM inference tools (vLLM, SGLang, RayServer)
  • Ray experience for distributed computing
  • Knowledge of mixed-precision training, quantisation, and model optimisation
  • Computer vision workflow experience
  • Data versioning tools (DVC, LakeFS)
  • Early-stage startup experience

You'll receive:

  • Competitive base salary: circa $250K (based on experience)
  • Significant stock package in a fast-growing company
  • Access to substantial GPU budget with expected growth
  • Healthcare (medical, dental, vision) and 401k with matching
  • 20 vacation days plus flexible working arrangements

You must be based in SF Bay Area or Miami (relocation is provided to Florida only). At this time we can only consider US citizens or green card holders.nd.

Ready to build the infrastructure that powers the future of production AI? All applicants will receive a response.

Location: United States
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 05/06/2025
Job ID: 33218

Ready to build AI agents that actually solve real-world problems?

Join a well-funded AI startup developing the next generation of agentic AI systems - general-purpose models capable of solving real-world user problems end-to-end with minimal prompting.

They're building an agentic AI desktop assistant that integrates deeply with operating systems, positioning as a potential competitor to OpenAI's operator. With over $40 million in funding, they're pushing the boundaries of what's possible in LLMs, reinforcement learning, and autonomous AI systems.

This role offers the rare opportunity to work on foundational AI research that directly translates into practical applications. You'll be building technology that transforms how people interact with computers, creating AI agents that don't just talk, but actually perform complex tasks autonomously.

 

Your focus:

You'll join the foundational AI research team tackling some of the most challenging problems in AI today. Your work will span from theoretical research to practical implementation, with the goal of building core models that surpass existing capabilities while maintaining highest accuracy and longest possible context windows.

As a Research Engineer, you'll design and implement cutting-edge LLM architectures specifically for agentic behaviour and real-world task solving. You'll own components of the RL training pipeline, including reward model development, evaluation, and deployment. Working with high-quality datasets for supervised fine-tuning and RL will be central to your role, as will experimenting with novel training algorithms for agentic systems.

You'll contribute to benchmark design and evaluation strategies for both reward models and policy models, staying at the frontier of LLM, RL, and agent research whilst turning theory into scalable systems that can be deployed at scale.

 

You should have:

A deep understanding of LLM post-training, fine-tuning, and evaluation, especially in the context of agents. Experience designing or training reward models for alignment or task optimisation is essential, along with hands-on experience implementing RLHF, DPO, or similar methods to align language models with human feedback.

You'll need strong intuition for dataset quality - knowing how to spot problematic data and craft effective supervision. Fluency with PyTorch, Hugging Face, and modern LLM libraries is crucial, as is the ability to prototype and scale new ideas quickly.

The ideal candidate can implement novel training algorithms, architectures, or evaluation strategies from research papers or original ideas, bringing both theoretical understanding and practical engineering skills to complex AI challenges.

Nice to have:

Experience scaling LLMs beyond 10B parameters would be valuable, as would prior research experience in LLMs, RL, or reward learning. Publications in relevant venues are appreciated but practical implementation experience is equally valued.

 

What they offer:

This is an opportunity to work with a top-tier team of researchers and engineers building aligned, general AI systems that are truly helpful and deployable. Your work will directly shape how AI interacts with the real world, moving beyond conversational interfaces to systems that perform meaningful tasks.

The compensation reflects the senior nature of the role: $200k-$250k base salary (negotiable based on experience) plus significant equity in a well-funded company positioned at the forefront of agentic AI. They provide comprehensive benefits and visa sponsorship for exceptional candidates.

Office based near Palo Alto, they're looking for energetic, fast-paced individuals who bring fresh excitement to the space and want to push the boundaries of what's possible with autonomous AI systems.

If you're excited about building AI agents that move beyond conversation to actual task completion, this could be your opportunity to make a lasting impact on the future of human-computer interaction.

Ready to help define the next generation of AI? All applicants will receive a response.

Location: Palo Alto, CA
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 29/05/2025
Job ID: 32459

Build agentic AI for industries where decisions truly matter - from healthcare to aerospace

Ready to create AI agents that solve real-world challenges in regulated environments? Join a well-funded startup developing secure, explainable AI systems for industries where standard approaches fall short.

You'll work as an AI Engineer in the Applied Science team, taking AI models from research to production and integrating agentic AI systems into real applications. This engineering-focused role involves building and deploying AI automation tools that tackle complex industry workflows across healthcare, aerospace, defence, and manufacturing.

Founded by a successful entrepreneur with previous billion-dollar exits, this company has raised $20M+ and is already partnering with Fortune 500 clients in regulated STEM fields. They're assembling a world-class team to create the next generation of secure, transparent AI agents.

The role offers unique technical challenges in agentic systems for enterprise environments, with access to significant compute resources and the opportunity to shape the direction of a fast-growing team in a no-ego culture focused on building working solutions.

Your focus:

  • Develop AI-driven reasoning agents and frameworks for automating complex tasks
  • Build and optimise RAG (Retrieval-Augmented Generation) pipelines
  • Distill and fine-tune AI models to improve reasoning, automation, and decision-making
  • Deploy AI models into scalable, production-ready systems using Python and cloud infrastructure
  • Collaborate with researchers to operationalise AI advancements into real applications
  • Work with complex datasets (proprietary, customer, and synthetic data)

You should have:

  • Proven experience building and deploying AI applications (ideally in a startup or fast-moving environment)
  • Hands-on expertise with LLMs and agentic frameworks (AutoGen, CrewAI, LangChain, DSPy, or Haystack)
  • Strong background in AI-powered automation and orchestration workflows
  • Experience with vector databases and retrieval systems (LlamaIndex, FAISS, Pinecone, Weaviate)
  • Solid coding skills in Python and familiarity with cloud deployment tools (AWS, GCP, or Azure)

Nice to have:

  • MLOps experience and production ML system deployment
  • Experience with fine-tuning AI models

You'll receive:

  • Competitive base salary: $160K-$230K (negotiable based on experience)
  • Up to 20% performance bonus
  • Significant options package
  • Healthcare (medical, dental, vision) 
  • 401k with up to 3% match 
  • 20 vacation days plus 10 sick days with flexible working hours
  • Relocation allowance for moves to Florida

You'll be already be based in SF Bay Area or willing to relocate to Miami, Florida, with relocation support provided. The team values clear communication, hands-on building, and collaborative problem-solving in an environment where your work directly impacts the product.

This is an ideal opportunity for an ML Engineer who has built AI applications before and wants to be part of a small, fast-moving team tackling autonomous workflows that solve real problems in mission-critical environments.

Ready to help shape computing intelligence that amplifies human innovation? All applicants will receive a response.

Location: United States
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 29/05/2025
Job ID: 32921

Pioneer groundbreaking research in AI agentic supervision

Want to develop the world's first agentic AI systems that can autonomously supervise and evaluate other AI? This Research Scientist role offers the chance to tackle fundamental challenges in AI supervision, red teaming, and automated evaluation at an unprecedented scale.

You'll join a company pioneering intelligent systems that evaluate advanced AI applications. Their technology, trusted by OpenAI, HP, and Fortune 100 companies, represents the cutting edge of scalable AI oversight. Despite being well-funded with $20M in backing, they're already generating significant ARR - proving real market demand for their solutions. Their evaluators outperform AI industry leaders internal solution by 18% in hallucination detection, and they've developed industry benchmarks used across multiple sectors.

The role focuses on research (50%) developing state-of-the-art systems for autonomous AI supervision, pushing beyond traditional approaches to create truly agentic evaluation frameworks. You'll conduct SOTA research, fine-tune LLMs, implement novel algorithms, conduct groundbreaking research on red teaming, and design experiments that advance the field of automated AI oversight. Here, you'll publish with the team in conjuction with top AI research labs and academic organisations, the role is 50% research and 50% implementation. 

The ideal candidate has:

  • Strong experience in Applied AI NLP research
  • Publications at leading AI conferences (NeurIPS, ICML, EMNLP, ACL, ICLR)
  • Experience fine-tuning LLMs (must-have)
  • Deep understanding of transformer architectures and evaluation metrics
  • Big Tech experience (highly desirable)
  • Experience in evals, LLM as judges, or red teaming

The company works with both startups and Fortune 100 enterprises. Their team has published at top ML conferences and built AI products at leading tech companies.

This role offers competitive compensation ($200k-$250k base - negotiable), significant stock options, full benefits, 401k, and unlimited PTO. Location preference for San Francisco Bay Area. Relocation assistance available. Remote considered for industry-leading researchers in evals.

Interested in pioneering the future of autonomous AI supervision?

Location: San Francisco Bay Area,
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 24/04/2025
Job ID: 33119

Lead AI research that shapes how AI evaluates AI

 

Want to pioneer AI supervision systems that ensure powerful models can be safely trusted? This leadership role offers a rare chance to solve one of AI's most critical challenges: making AI systems that can effectively evaluate other AI.

 

You'll join a well-funded startup backed by top Silicon Valley VCs and led by a team with elite ML research backgrounds from FAIR, Meta Reality Labs, and quant finance. Their technology is already trusted by OpenAI, HP, and Fortune 100 companies across education, finance, and healthcare.

 

As Head of AI, you'll lead a team building cutting-edge evaluation systems that leverage memory, long-context reasoning, and multimodal capabilities. Your research won't just advance the field – it will be immediately implemented in products that solve real enterprise AI safety challenges.

 

The team has published at top ML conferences (NeurIPS, EMNLP, ACL) and developed models and benchmarks used by leading AI companies worldwide. Their evaluators already outperform OpenAI's by 18% in hallucination detection.

 

Your focus:

  • Lead a research and ML engineering team developing state-of-the-art AI supervision systems
  • Solve open research problems in evaluation, explainability, and robustness
  • Set research vision alongside the CTO and establish rigorous research processes
  • Guide development of novel benchmarks for SOTA AI systems
  • Represent the company through publications, speaking, and industry relationships
  • Build and grow a world-class technical team

You should have:

  • PhD in Computer Science, Mathematics, Statistics, Linguistics or related field
  • Strong publication record at top AI conferences (NeurIPS, ICML, EMNLP, ACL)
  • Experience conducting empirical NLP research in academic or industry settings
  • Deep knowledge of transformer architectures and evaluation frameworks
  • Experience training language models in applied or research contexts
  • Ability to communicate complex technical concepts across different audiences

The package:

  • Competitive salary ($300K-$350K base) 
  • Meaningful equity in a well-funded startup
  • Performance bonus
  • Full health, dental and vision coverage
  • 401k plan
  • Unlimited PTO
  • Regular global team off site days

Location preference for NYC or SF, with flexibility for exceptional candidates.

If you're passionate about ensuring advanced AI systems can be effectively supervised and evaluated, this role offers the chance to make a significant impact on the future of AI safety and deployment.

Location: San Francisco Bay Area,
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 17/04/2025
Job ID: 33020

Do you want to create AI that converses as naturally as humans do? A pioneering healthtech unicorn is building AI digital health agents designed to safely and empathetically assist patients.

As the Staff Research Scientist, you'll play a key part in making this a reality - building end-to-end foundational speech models capable of full-duplex communication. This isn't just about taking turns speaking; it's about creating AI that can listen and respond simultaneously with human-like conversation, emotions, and natural language that healthcare demands.

What you'll do

  • Design and develop novel speech foundation models for healthcare conversations, working end-to-end from research through to productionizing models
  • Work on post-training LLMs for speech to enhance their conversational capabilities
  • Tackle unique challenges including response time optimization, maintaining alignment between text and speech outputs, and operating in noisy environments
  • Create innovative approaches to synthetic conversational data generation
  • Have the opportunity to publish your groundbreaking research

What you'll bring

  • PhD with 8+ years in speech technologies or related field
  • Experience with Speech LLMs
  • Experience training large datasets
  • Strong publication record at top-tier conferences in speech/multimodal AI
  • Ability to implement research papers from scratch

Bonus points for

  • Experience pre-training foundation models with speech (HuBERT, Wav2Vec, or similar)
  • Multimodal experience
  • Experience with inference technologies (vLLM, CUDA)

You'll be based in the Bay Area and will receive highly competitive comp (up to $350K base DOE) with substantial equity.

If you're excited about creating the next generation of speech AI that will revolutionize healthcare communication, click apply!

Location: Bay Area
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 16/04/2025
Job ID: 33086

Shape the future of agentic AI through cutting-edge data strategy

Want to pioneer next-generation data techniques for advanced AI systems? This role combines frontier model research with practical implementation at one of Europe's most ambitious AI startups.

You'll join a rapidly growing AI Data team developing cutting-edge data-centric approaches that enhance LLMs, VLMs, and Action Models. This isn't just about collecting data – it's about transforming how AI systems learn and operate through synthetic generation, model distillation, and preference alignment.

Founded with a clear mission to push the boundaries of superintelligent agentic AI, this well-funded startup ($200M raised) is assembling world-class talent focused on both advancing capabilities and ensuring responsible development. Their approach is comprehensive – building proprietary technology from data to models, focusing on language, multimodal, and vision systems with superior performance and cost-effectiveness.

As an Applied Engineer focusing on Data Research, you'll develop sophisticated data strategies that directly impact frontier AI systems:

  • Generate and augment synthetic multimodal datasets for VQA, agent behaviours, and virtual navigation
  • Apply model distillation techniques to optimise large-scale models for edge deployment
  • Design evaluation frameworks to measure improvements across multiple domains
  • Lead research into aligning data with human and AI preferences
  • Collaborate with cross-functional teams to integrate data-driven solutions

This role offers rare access to significant compute resources, with a massive GPU cluster that enables cutting-edge work. You'll be joining at a pivotal stage where your contributions will shape core technology and direction.

Requirements:

  • Strong Python programming skills covering parallel computing, system design, and large-scale deployments
  • Experience developing multimodal data pipelines
  • Background in training and deploying LLMs, VLMs or PyTorch models
  • MSc or PhD in machine learning, computer vision, NLP, or related field
  • Deep understanding of training and evaluation paradigms for multimodal models
  • Effectiveness in fast-changing environments

Nice to have:

  • Experience with agent-specific data pipelines
  • Background in multimodal human annotation platforms
  • Document understanding/OCR expertise
  • Synthetic data generation experience (particularly multimodal)

You'll have flexibility to work from New York, London, or remotely within European or US East Coast time zones. For those based in cities with offices, hybrid arrangements are available.

Your package includes a highly competitive salary ($200,000-$350,000 depending on experience) plus significant equity with strong upside potential.

If you're passionate about advancing AI through innovative data approaches and want to make a lasting impact on agentic systems, we'd love to hear from you. All applicants will receive a response.

Location: NYC or London
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 15/04/2025
Job ID: 33152

Ready to drive the next breakthrough in speech AI?

This rapidly growing, well-funded tech startup is redefining speech technology with industry-leading accuracy and latency. Their voice AI platform is already outperforming established competitors while supporting languages traditionally overlooked by mainstream technologies.

With a strong client base and significant investment secured, they're now at an exciting inflection point - focusing on building proprietary speech AI systems from the ground up. This presents a rare opportunity to shape the future of multilingual speech technology.

The role

As Lead Speech Research Engineer, you'll direct the technical vision for their next phase of innovation. You'll have the freedom to make foundational architectural decisions and establish new research directions.

You'll work at the intersection of cutting-edge research and practical implementation, with the autonomy to build solutions that push the boundaries of what's possible in speech AI.

What makes this opportunity special

  • You'll define research strategy rather than implementing someone else's vision
  • Build something entirely new with technical freedom
  • Directly influence product direction and company growth
  • See your innovations impact real-world applications
  • Opportunities to attend conferences and publish research in the future

What you'll do

  • Shape the research vision and technical architecture for next-generation speech models
  • Develop E2E speech models with proprietary solutions that push performance boundaries
  • Guide a growing R&D team on novel approaches and architectural innovations
  • Create systems that raise the bar for accuracy, latency, and multilingual performance
  • Establish best practices for data preparation and model optimization

What you'll bring

  • 8+ years of experience developing ML systems for audio applications
  • Proven success building speech models end-to-end, not just fine-tuning existing ones
  • Experience with large-scale model training and optimization
  • 1st author publications at top-tier speech/audio conferences 
  • Deep understanding of modern speech processing techniques and architectures

You'll get

  • To work remotely (in EU or US Eastern timezone) or hybrid in Europe. You'll also get to meet periodically with for team gatherings in Europe.
  • Salary up to €200K DOE
  • Unlimited PTO and comprehensive health coverage
  • Generous pension plan and lifestyle benefits

If you're passionate about pushing the boundaries of speech AI and want the freedom to build something revolutionary, this is your opportunity to make a significant impact. Get in touch today!

Location: Remote
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 11/04/2025
Job ID: 33053

Build secure, deployable AI systems for industries where precision matters

Want to implement cutting-edge AI solutions for high-stakes industries? This role combines ML engineering expertise with real-world impact in aerospace, healthcare, manufacturing and defence.

You'll join a small but fast-growing engineering team at a well-funded startup transforming how regulated industries leverage AI. Founded by successful entrepreneurs with billion-dollar exits, they've secured $20M and are already working with Fortune 500 clients where standard AI approaches fall short.

This role offers a unique chance to work on AI challenges with meaningful real-world impact, with access to abundant compute resources for complex development. You'll help shape technical direction as an early team member, balancing practical implementation with cutting-edge ML research across multiple high-value industries in a no-ego culture focused on building working solutions.

As an ML Engineer, you'll bridge the gap between research and practical implementation, ensuring sophisticated AI models work reliably in mission-critical environments.

Your focus:

  • Implementing and optimizing models for specific client needs across regulated industries
  • Building models (NLP or Computer Vision - depending on your background)
  • Working with LLMs, such as API integrations, fine-tuning and prompting
  • Developing client-specific models and adapting solutions using transfer learning
  • Conducting evaluations and handling customer data onboarding
  • Collaborating with the Applied AI team to operationalize research breakthroughs

You should have:

  • 5+ years of experience in ML or software engineering with ML focus
  • Good practical experience in PyTorch or TensorFlow
  • Experience deploying ML models in production environments
  • Solid AWS knowledge (or comparable cloud platform expertise)
  • Python proficiency and familiarity with ML deployment challenges

Nice to have:

  • Computer vision or NLP expertise
  • Experience in regulated industries (healthcare, aerospace, defence)
  • Experience with Kubernetes
  • Prior work with MLOps tooling and practices
  • Science/engineering degree (Masters preferred)

They offer:

  • Competitive salary ($180k-$225k depending on experience) plus bonus potential
  • Significant equity package
  • Comprehensive healthcare (medical, dental, vision)
  • 401k contributions
  • 20 days vacation and flexible working arrangements

You'll need to be based in the US – Seattle or Miami (attractive relocation package provided for Florida).

If building secure, practical AI systems for industries where decisions truly matter appeals to you, reach out for a confidential conversation. All applicants will receive a response.

 
Location: Fort Lauderdale / Miami, FL
Job type: Permanent
Emp type: Full-time
Salary type: Annual
Salary: negotiable
Job published: 27/03/2025
Job ID: 32888