Research Engineer - Reinforcement Learning Fundamentals

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$250,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Software Engineer

Senior Software Engineering role at Anthropic focusing on building and scaling ML systems, offering $280-485K salary with hybrid work model in SF, NYC, or Seattle.

Research Engineer

Senior Research Engineer position at Anthropic focusing on redesigning how AI systems interact with external data sources through innovative information architecture and LLM training.

Machine Learning Systems Engineer

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

Machine Learning Systems Engineer

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

ML Systems Engineer

ML Systems Engineer role at Anthropic focusing on building and improving AI model training systems and infrastructure.

Description For Research Engineer - Reinforcement Learning Fundamentals

Anthropic is seeking a Research Engineer for their Reinforcement Learning Fundamentals team. This role involves collaborating with researchers and engineers to advance large language models through reinforcement learning research. Key responsibilities include developing novel RL techniques, creating tools for complex tasks, and enhancing reasoning capabilities in areas like code generation and mathematics. The ideal candidate has 5+ years of industry experience, proficiency in Python and deep learning frameworks, strong software engineering skills, and a passion for AI safety. The role offers competitive compensation, including a salary range of £250,000 - £340,000 GBP, equity, and comprehensive benefits. Anthropic values diversity and encourages applications from all backgrounds. The company operates on a hybrid work model, with at least 25% office presence required, and offers visa sponsorship. Anthropic is committed to big science AI research, working as a cohesive team on large-scale efforts to create steerable, trustworthy AI systems.

Last updated 6 months ago

Responsibilities For Research Engineer - Reinforcement Learning Fundamentals

  • Develop and implement novel reinforcement learning techniques
  • Create tools and environments for models to perform complex tasks
  • Design and run experiments to enhance models' reasoning capabilities
  • Collaborate with researchers and engineers

Requirements For Research Engineer - Reinforcement Learning Fundamentals

Python
Kubernetes
  • 5+ years of industry-related experience
  • Proficiency in Python
  • Experience with deep learning frameworks (PyTorch or Jax)
  • Strong software engineering background
  • Passion for AI safety and beneficial systems

Benefits For Research Engineer - Reinforcement Learning Fundamentals

Medical Insurance
Dental Insurance
Vision Insurance
401k
Parental Leave
Education Budget
Commuter Benefits
Equity
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support
  • Equity donation matching

Interested in this job?