Job title:	Research Scientist - RL Simulated Environments
Job type:	Permanent
Emp type:	Full-time
Industry:	AI Agents
Salary type:	Annual
Salary:	negotiable
Location:	San Francisco
Job published:	12/03/2026
Job ID:	33119

Job Description

Advance how agents and LLMs learn from feedback in realistic environments.

If you've been working at the intersection of reinforcement learning and large language models, this is an opportunity to work on the foundations of how AI systems are trained, evaluated, and supervised, with your research shipping into production.

You will work hands-on on fundamental problems spanning LLM post-training, RL simulation environments, and agentic evaluation, shaping core methods and benchmarks used by leading AI labs and enterprises around the world.

The team actively publishes and collaborates with external research labs, with recent work appearing at ACL and NeurIPS. You'll see your ideas move from concept to deployed systems, working alongside engineers who build fast and take research seriously.

This is a research-driven company growing quickly due to real demand for what they're building. If you want your work to matter, both in the literature and in production, this is where to do it.

You'll bring hands-on experience in applied research across RL, LLM post-training, or agent-based systems, with a strong understanding of transformer architectures and fine-tuning. As important as the theory is the ability to ship — you can translate research ideas into production-ready systems that actually work. A track record of publishing at top-tier venues such as NeurIPS, ICML, ACL, or EMNLP is a plus, but what matters most is the quality of your thinking and your ability to execute.

What you'll do

Conduct research on LLM post-training methods (RLHF, RLAIF, RLVR)
Design and build realistic RL simulation environments for agents
Develop agentic evaluation and supervision frameworks
Create and maintain benchmarks for emerging AI capabilities
Collaborate with engineers to take research from idea to deployed systems

Location — San Francisco · Salary — Up to $300k base + equity, flexible and negotiable DOE

All applicants will receive a response.

Location:	San Francisco, CA
Job type:	Permanent
Emp type:	Full-time
Salary type:	Annual
Salary:	negotiable
Job published:	30/03/2026
Job ID:	35569

Job Description

Our use of cookies