Pioneer groundbreaking research in AI agentic supervision
Want to develop the world's first agentic AI systems that can autonomously supervise and evaluate other AI? This Research Scientist role offers the chance to tackle fundamental challenges in AI supervision, red teaming, and automated evaluation at an unprecedented scale.
You'll join a company pioneering intelligent systems that evaluate advanced AI applications. Their technology, trusted by OpenAI, HP, and Fortune 100 companies, represents the cutting edge of scalable AI oversight. Despite being well-funded with $20M in backing, they're already generating significant ARR - proving real market demand for their solutions. Their evaluators outperform AI industry leaders internal solution by 18% in hallucination detection, and they've developed industry benchmarks used across multiple sectors.
The role focuses on research (50%) developing state-of-the-art systems for autonomous AI supervision, pushing beyond traditional approaches to create truly agentic evaluation frameworks. You'll conduct SOTA research, fine-tune LLMs, implement novel algorithms, conduct groundbreaking research on red teaming, and design experiments that advance the field of automated AI oversight. Here, you'll publish with the team in conjuction with top AI research labs and academic organisations, the role is 50% research and 50% implementation.
The ideal candidate has:
- Strong experience in Applied AI NLP research
- Publications at leading AI conferences (NeurIPS, ICML, EMNLP, ACL, ICLR)
- Experience fine-tuning LLMs (must-have)
- Deep understanding of transformer architectures and evaluation metrics
- Big Tech experience (highly desirable)
- Experience in evals, LLM as judges, or red teaming
The company works with both startups and Fortune 100 enterprises. Their team has published at top ML conferences and built AI products at leading tech companies.
This role offers competitive compensation ($200k-$250k base - negotiable), significant stock options, full benefits, 401k, and unlimited PTO. Location preference for San Francisco Bay Area. Relocation assistance available. Remote considered for industry-leading researchers in evals.
Interested in pioneering the future of autonomous AI supervision?
Location: | San Francisco Bay Area, |
---|---|
Job type: | Permanent |
Emp type: | Full-time |
Salary type: | Annual |
Salary: | negotiable |
Job published: | 24/04/2025 |
Job ID: | 33119 |