Machine Learning Systems Engineer, RL Engineering

Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

San Francisco, CA, USA • New York, NY, USA • Seattle, WA, USA

$300,000 - $425,000

Machine Learning

Mid-Level Software Engineer

Hybrid

2+ years of experience

Description For Machine Learning Systems Engineer, RL Engineering

You want to build the cutting-edge systems that train AI models like Claude. You're excited to work at the frontier of machine learning, implementing and improving advanced techniques to create ever more capable, reliable and steerable AI. As an ML Systems Engineer on our Reinforcement Learning Engineering team, you'll be responsible for the critical algorithms and infrastructure that our researchers depend on to train models. Your work will directly enable breakthroughs in AI capabilities and safety. You'll focus obsessively on improving the performance, robustness, and usability of these systems so our research can progress as quickly as possible. You're energized by the challenge of supporting and empowering our research team in the mission to build beneficial AI systems.

Our finetuning researchers train our production Claude models, and internal research models, using RLHF and other related methods. Your job will be to build, maintain, and improve the algorithms and systems that these researchers use to train models. You'll be responsible for improving the speed, reliability, and ease-of-use of these systems.

You may be a good fit if you:

Have 2+ years of software engineering experience
Like working on systems and tools that make other people more productive
Are results-oriented, with a bias towards flexibility and impact
Pick up slack, even if it goes outside your job description
Enjoy pair programming (we love to pair!)
Want to learn more about machine learning research
Care about the societal impacts of your work

Strong candidates may also have experience with:

High performance, large scale distributed systems
Kubernetes
Python
Implementing LLM finetuning algorithms, such as RLHF

Representative projects:

Profiling our reinforcement learning pipeline to find opportunities for improvement
Building a system that regularly launches training jobs in a test environment so that we can quickly detect problems in the training pipeline
Making changes to our finetuning systems so they work on new model architectures
Building instrumentation to detect and eliminate Python GIL contention in our training code
Diagnosing why training runs have started slowing down after some number of steps, and fixing it
Implementing a stable, fast version of a new training algorithm proposed by a researcher

Anthropic offers competitive compensation and benefits, optional equity donation matching, generous vacation and parental leave, flexible working hours, and a lovely office space in which to collaborate with colleagues.

Last updated 6 months ago

Responsibilities For Machine Learning Systems Engineer, RL Engineering

Build, maintain, and improve algorithms and systems for training AI models
Improve performance, robustness, and usability of training systems
Support and empower the research team in building beneficial AI systems
Implement and improve advanced machine learning techniques
Optimize reinforcement learning pipelines
Develop systems for detecting problems in the training pipeline
Adapt finetuning systems for new model architectures
Diagnose and fix performance issues in training runs
Implement new training algorithms proposed by researchers

Requirements For Machine Learning Systems Engineer, RL Engineering

Python

Kubernetes

2+ years of software engineering experience
Experience with high performance, large scale distributed systems
Knowledge of Kubernetes
Python programming skills
Experience implementing LLM finetuning algorithms, such as RLHF

Benefits For Machine Learning Systems Engineer, RL Engineering

Medical Insurance

Dental Insurance

Vision Insurance

401k

Parental Leave

Education Budget

Commuter Benefits

Relocation Benefits

Health insurance
Dental insurance
Vision insurance
401(k) with 4% matching
22 weeks of paid parental leave
Unlimited PTO
Education stipend
Home office improvement stipend
Commuting stipend
Wellness stipend
Fertility benefits
Daily lunches and snacks in office
Relocation support

Anthropic

Anthropic's mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole.

San Francisco, CA, USA • New York, NY, USA • Seattle, WA, USA

$300,000 - $425,000

Machine Learning

Mid-Level Software Engineer

Hybrid

2+ years of experience

Interested in this job?

Jobs Related To Anthropic Machine Learning Systems Engineer, RL Engineering

Research Engineer - Societal Impacts

Anthropic

Research Engineer position at Anthropic focusing on societal impacts of AI systems, infrastructure development, and safety research.

Software Engineer II, Customer eXperience Impressions (CXI)

Amazon

Software Engineer position at Amazon's CXI team developing ML systems to detect and fix shopping experience issues, offering competitive pay and benefits.

Software Development Engineer II - DSO, Demand Science Optimization (DSO)

Amazon

Software Development Engineer II position at Amazon's DSO team, focusing on ML-driven demand forecasting and supply management for Amazon Devices.

Software Development Engineer, Predictive Targeting

Amazon

Software Development Engineer role at Amazon focusing on machine learning and predictive analytics for customer targeting systems

Machine Learning Engineer, Amazon One

Amazon

Machine Learning Engineer position at Amazon AWS, focusing on biometric identity solutions, deep learning, and computer vision, offering competitive salary and benefits.