Research Engineer, Horizons

Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

London, UK

$250,000 - $340,000

Machine Learning

Senior Software Engineer

Hybrid

5+ years of experience

This job posting may no longer be active. You may be interested in these related jobs instead:

Description For Research Engineer, Horizons

Anthropic is seeking a Research Engineer to join their Reinforcement Learning Fundamentals team. In this role, you will collaborate with researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning. You'll work on improving reasoning abilities in areas such as code generation and mathematics, and explore reinforcement learning for agentic / open-ended tasks.

Key responsibilities include:

Developing and implementing novel reinforcement learning techniques to improve the performance and safety of large language models
Creating tools and environments for models to interact with, enabling them to perform complex, open-ended tasks
Designing and running experiments to enhance models' reasoning capabilities, particularly in code generation and mathematics

The ideal candidate will have:

5+ years of industry-related experience
Proficiency in Python and experience with deep learning frameworks such as PyTorch or Jax
Strong software engineering background
Passion for pair programming
Commitment to code quality, testing, and performance
A deep interest in the potential impact of AI and dedication to developing safe and beneficial systems

Strong candidates may also have:

Background in machine learning, reinforcement learning, or high performance computing
Experience with virtualization and sandboxed code execution environments
Experience with Kubernetes
Contributions to open-source projects or published research papers in relevant fields

Anthropic offers a competitive compensation package, including a salary range of £250,000 - £340,000 GBP, equity, and comprehensive benefits. The company has a hybrid work policy, expecting staff to be in one of their offices at least 25% of the time.

Join Anthropic in their mission to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society as a whole.

Last updated 9 months ago

Responsibilities For Research Engineer, Horizons

Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models
Create tools and environments for models to interact with, enabling them to perform complex, open-ended tasks
Design and run experiments to enhance models' reasoning capabilities, particularly in code generation and mathematics

Requirements For Research Engineer, Horizons

Python

Kubernetes

5+ years of industry-related experience
Proficient in Python and have experience with deep learning frameworks such as PyTorch or Jax
Strong software engineering background
Enjoy pair programming
Care about code quality, testing, and performance
Passionate about the potential impact of AI and are committed to developing safe and beneficial systems

Benefits For Research Engineer, Horizons

Medical Insurance

Dental Insurance

Vision Insurance

401k

Parental Leave

Education Budget

Commuter Benefits

Comprehensive health, dental, and vision insurance
401(k) plan with 4% matching
22 weeks of paid parental leave
Unlimited PTO
Stipends for education, home office improvements, commuting, and wellness
Fertility benefits via Carrot
Daily lunches and snacks in office
Relocation support