Research Engineer, Horizons

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.
$250,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI

Description For Research Engineer, Horizons

Anthropic is seeking a Research Engineer to join their Reinforcement Learning Fundamentals team. In this role, you will collaborate with researchers and engineers to advance the capabilities and safety of large language models through fundamental research in reinforcement learning. You'll work on improving reasoning abilities in areas such as code generation and mathematics, and explore reinforcement learning for agentic / open-ended tasks.

Key responsibilities include:

  • Developing and implementing novel reinforcement learning techniques to improve the performance and safety of large language models
  • Creating tools and environments for models to interact with, enabling them to perform complex, open-ended tasks
  • Designing and running experiments to enhance models' reasoning capabilities, particularly in code generation and mathematics

The ideal candidate will have:

  • 5+ years of industry-related experience
  • Proficiency in Python and experience with deep learning frameworks such as PyTorch or Jax
  • Strong software engineering background
  • Passion for pair programming
  • Commitment to code quality, testing, and performance
  • A deep interest in the potential impact of AI and dedication to developing safe and beneficial systems

Strong candidates may also have:

  • Background in machine learning, reinforcement learning, or high performance computing
  • Experience with virtualization and sandboxed code execution environments
  • Experience with Kubernetes
  • Contributions to open-source projects or published research papers in relevant fields

Anthropic offers a competitive compensation package, including a salary range of £250,000 - £340,000 GBP, equity, and comprehensive benefits. The company has a hybrid work policy, expecting staff to be in one of their offices at least 25% of the time.

Join Anthropic in their mission to create reliable, interpretable, and steerable AI systems that are safe and beneficial for users and society as a whole.

Last updated 2 months ago

Responsibilities For Research Engineer, Horizons

  • Develop and implement novel reinforcement learning techniques to improve the performance and safety of large language models
  • Create tools and environments for models to interact with, enabling them to perform complex, open-ended tasks
  • Design and run experiments to enhance models' reasoning capabilities, particularly in code generation and mathematics

Requirements For Research Engineer, Horizons

Python
Kubernetes
  • 5+ years of industry-related experience
  • Proficient in Python and have experience with deep learning frameworks such as PyTorch or Jax
  • Strong software engineering background
  • Enjoy pair programming
  • Care about code quality, testing, and performance
  • Passionate about the potential impact of AI and are committed to developing safe and beneficial systems

Benefits For Research Engineer, Horizons

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
Commuter Benefits
  • Comprehensive health, dental, and vision insurance
  • 401(k) plan with 4% matching
  • 22 weeks of paid parental leave
  • Unlimited PTO
  • Stipends for education, home office improvements, commuting, and wellness
  • Fertility benefits via Carrot
  • Daily lunches and snacks in office
  • Relocation support

Interested in this job?

Jobs Related To Anthropic Research Engineer, Horizons

Research Scientist/Engineer - AI Safety (Biosecurity)

Join Anthropic's team to research and mitigate extreme risks from future AI models, focusing on biosecurity.

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models.

Software Engineer

Senior Software Engineer role at Anthropic, working on large-scale ML systems for safe and beneficial AI.

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models.