Research Engineer - Reinforcement Learning Fundamentals

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$250,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI

Description For Research Engineer - Reinforcement Learning Fundamentals

Anthropic is seeking a Research Engineer for their Reinforcement Learning Fundamentals team. This role involves collaborating with researchers and engineers to advance large language models through reinforcement learning research. Key responsibilities include developing novel RL techniques, creating tools for complex tasks, and enhancing reasoning capabilities in areas like code generation and mathematics. The ideal candidate has 5+ years of industry experience, proficiency in Python and deep learning frameworks, strong software engineering skills, and a passion for AI safety. The role offers competitive compensation, including a salary range of £250,000 - £340,000 GBP, equity, and comprehensive benefits. Anthropic values diversity and encourages applications from all backgrounds. The company operates on a hybrid work model, with at least 25% office presence required, and offers visa sponsorship. Anthropic is committed to big science AI research, working as a cohesive team on large-scale efforts to create steerable, trustworthy AI systems.

Last updated a month ago

Responsibilities For Research Engineer - Reinforcement Learning Fundamentals

  • Develop and implement novel reinforcement learning techniques
  • Create tools and environments for models to perform complex tasks
  • Design and run experiments to enhance models' reasoning capabilities
  • Collaborate with researchers and engineers

Requirements For Research Engineer - Reinforcement Learning Fundamentals

Python
Kubernetes
  • 5+ years of industry-related experience
  • Proficiency in Python
  • Experience with deep learning frameworks (PyTorch or Jax)
  • Strong software engineering background
  • Passion for AI safety and beneficial systems

Benefits For Research Engineer - Reinforcement Learning Fundamentals

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
Commuter Benefits
Equity
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support
  • Equity donation matching

Interested in this job?

Jobs Related To Anthropic Research Engineer - Reinforcement Learning Fundamentals

Research Scientist/Engineer - AI Safety (Biosecurity)

Join Anthropic's team to research and mitigate extreme risks from future AI models, focusing on biosecurity.

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models.

Software Engineer

Senior Software Engineer role at Anthropic, working on large-scale ML systems for safe and beneficial AI.

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models.