Research Engineer, Frontier Red Team

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research, engineering, and policy expertise.
$280,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
101 - 500 Employees
5+ years of experience
AI · Cybersecurity

Description For Research Engineer, Frontier Red Team

Anthropic is seeking a Research Engineer for their Frontier Red Team to develop and implement "gold standard" evaluations for catastrophic risks in AI systems. This role is crucial for implementing the company's Responsible Scaling Policy (RSP) and ensuring the safe deployment of frontier AI models. The position involves creating evaluation systems for some of the most capable AI systems ever built, collaborating across multiple domains including biosecurity, cybersecurity, and national security. The ideal candidate will combine strong engineering skills with a dedication to AI safety, working to build and scale novel evaluation infrastructure that could become industry standards. The role offers competitive compensation ($280,000-$340,000), hybrid work arrangements in San Francisco, and comprehensive benefits. Anthropic operates as a public benefit corporation, focusing on big science approaches to AI research with a collaborative, impact-driven culture. The company values diverse perspectives and encourages applications from candidates who might not meet every qualification but are passionate about contributing to safe and beneficial AI development.

Last updated 3 months ago

Responsibilities For Research Engineer, Frontier Red Team

  • Design and implement robust evaluation infrastructure to measure model capabilities and risks across multiple domains
  • Lead technical projects to build and scale evaluation systems
  • Collaborate with domain experts to translate insights into concrete evaluation frameworks
  • Build sandboxed testing environments and automated pipelines for continuous model assessment
  • Work closely with researchers to rapidly prototype and iterate on new evaluation approaches
  • Partner with cross-functional teams to advance Anthropic's safety mission
  • Contribute to Capability Reports that inform critical deployment decisions

Requirements For Research Engineer, Frontier Red Team

Python
  • Experience leading and conducting fast, iterative experiments with frontier AI models
  • Experience designing or implementing evaluations involving LLM sampling and prompting
  • Strong software engineering skills with extensive Python experience
  • Experience working with distributed systems
  • Ability to write clean, well-structured code
  • Strong interest in AI safety and responsible development
  • Self-starter mentality and comfort in fast-paced environments
  • Ability to balance urgency with careful implementation

Benefits For Research Engineer, Frontier Red Team

Visa Sponsorship
Parental Leave
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space in San Francisco

Interested in this job?

Jobs Related To Anthropic Research Engineer, Frontier Red Team

Software Engineer

Senior Software Engineering role at Anthropic focusing on building and scaling ML systems, offering $280-485K salary with hybrid work model in SF, NYC, or Seattle.

Research Engineer

Senior Research Engineer position at Anthropic focusing on redesigning how AI systems interact with external data sources through innovative information architecture and LLM training.

Machine Learning Systems Engineer

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

Machine Learning Systems Engineer

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

ML Systems Engineer

ML Systems Engineer role at Anthropic focusing on building and improving AI model training systems and infrastructure.