Research Engineer - Reinforcement Learning Fundamentals

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$250,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI

Description For Research Engineer - Reinforcement Learning Fundamentals

Anthropic is seeking a Research Engineer for their Reinforcement Learning Fundamentals team. This role involves collaborating with researchers and engineers to advance large language models through reinforcement learning research. Key responsibilities include developing novel RL techniques, creating tools for complex tasks, and enhancing reasoning capabilities in areas like code generation and mathematics. The ideal candidate has 5+ years of industry experience, proficiency in Python and deep learning frameworks, strong software engineering skills, and a passion for AI safety. The role offers competitive compensation, including a salary range of £250,000 - £340,000 GBP, equity, and comprehensive benefits. Anthropic values diversity and encourages applications from all backgrounds. The company operates on a hybrid work model, with at least 25% office presence required, and offers visa sponsorship. Anthropic is committed to big science AI research, working as a cohesive team on large-scale efforts to create steerable, trustworthy AI systems.

Last updated 2 months ago

Responsibilities For Research Engineer - Reinforcement Learning Fundamentals

  • Develop and implement novel reinforcement learning techniques
  • Create tools and environments for models to perform complex tasks
  • Design and run experiments to enhance models' reasoning capabilities
  • Collaborate with researchers and engineers

Requirements For Research Engineer - Reinforcement Learning Fundamentals

Python
Kubernetes
  • 5+ years of industry-related experience
  • Proficiency in Python
  • Experience with deep learning frameworks (PyTorch or Jax)
  • Strong software engineering background
  • Passion for AI safety and beneficial systems

Benefits For Research Engineer - Reinforcement Learning Fundamentals

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
Commuter Benefits
Equity
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support
  • Equity donation matching

Interested in this job?

Jobs Related To Anthropic Research Engineer - Reinforcement Learning Fundamentals

Trust and Safety Machine Learning Engineer

Senior ML Engineer role at Anthropic focusing on AI safety and trust mechanisms, offering competitive compensation and hybrid work in San Francisco.

Machine Learning Research Engineer

Senior ML Research Engineer role at Anthropic focusing on developing safe and reliable AI systems, including large language models and multimodal capabilities.

Research Scientist/Engineer - AI Safety (Biosecurity)

Join Anthropic's team to research and mitigate extreme risks from future AI models, focusing on biosecurity.

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models.