Job title: ML Researcher - LLMs, Reinforcement learning
Job type: Permanent
Emp type: Full-time
Industry: Artificial Intelligence & Machine Learning
Salary type: Annual
Salary: negotiable
Location: London, UK
Job published: 19/07/2024
Job ID: 32261

Job Description

Are you looking for the next position you take to combine true impact whilst utilising the latest Language generation technology?

 

Join this tech-for-good start-up as an ML Research Engineer, building the first foundation model in their domain, solving an already large and growing challenge across the world.

 

Alongside their foundation model and first-of-its-kind dataset, which is their B2B market, they also have a B2C/Consumer app, which currently has no competitors (yet). So far their solution has been well received by those in the industry and those who are looking to access this type of technology to help improve their lives and wellbeing.

 

You’ll work on unique technology with impact. Building proprietary tech from the group up and leveraging the latest language models (GPT-4o and LLaMa). You’ll solve challenges on their core models. Your work with have a direct impact on making their models smarter and language technology more natural. Their unique dataset is constantly collecting new data, so there will naturally be some data collection, curation and processing involved. Some model orchestration and optimisation is required and they have a strong partnership with OpenAI for this.

 

You’ll be pre-training (including continued pre-training) and fine-tuning LLMs, large ones too; up to 70 billion parameter scale models. Your previous experience must include at the very least fine-tuning LLMs, but if you have already done pre-training that would be desirable. The usual Python and PyTorch experience is expected, with Deep Learning experience in the language domain.

 

The role is made up of around 40% research and experimentation, 50% ML engineering including model building, training, fine-tuning, and pre-training and the remaining 10% will be focused on deployments and inference challenges.

 

Your research work will be on SOTA Deep Learning, LLMs / Transformers and novel approaches in reinforcement learning for alignment challenges. Multi-GPU training will be familiar to you when it comes to model training processes.

 

This start-up is moving fast, if you have experience in start-ups or start-up-style environments, you’ll fit right in.

 

This is a hybrid role, in London for 2-3 days per week. Relocation support can be provided if you’d like to move to London.

 

The total compensation will be somewhere in the region of £200,000 - £350,000 for the ML researcher. For an experienced ML Lead, it would be up to circa £500,000. Which will involve a mixture of base salary and stock options along with the usual benefits you’d expect.

 

Apply now for immediate consideration.

 

Questionnaire

Apply with indeed
File types (doc, docx, pdf, rtf, png, jpeg, jpg, bmp, jng, ppt, pptx, csv, gif) size up to 5MB