Research Engineer, Frontier Red Team

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research, engineering, and policy expertise.
$280,000 - $340,000
Machine Learning
Senior Software Engineer
Hybrid
101 - 500 Employees
5+ years of experience
AI · Cybersecurity

Description For Research Engineer, Frontier Red Team

Anthropic is seeking a Research Engineer for their Frontier Red Team to develop and implement "gold standard" evaluations for catastrophic risks in AI systems. This role is crucial for implementing the company's Responsible Scaling Policy (RSP) and ensuring the safe deployment of frontier AI models. The position involves creating evaluation systems for some of the most capable AI systems ever built, collaborating across multiple domains including biosecurity, cybersecurity, and national security. The ideal candidate will combine strong engineering skills with a dedication to AI safety, working to build and scale novel evaluation infrastructure that could become industry standards. The role offers competitive compensation ($280,000-$340,000), hybrid work arrangements in San Francisco, and comprehensive benefits. Anthropic operates as a public benefit corporation, focusing on big science approaches to AI research with a collaborative, impact-driven culture. The company values diverse perspectives and encourages applications from candidates who might not meet every qualification but are passionate about contributing to safe and beneficial AI development.

Last updated 2 minutes ago

Responsibilities For Research Engineer, Frontier Red Team

  • Design and implement robust evaluation infrastructure to measure model capabilities and risks across multiple domains
  • Lead technical projects to build and scale evaluation systems
  • Collaborate with domain experts to translate insights into concrete evaluation frameworks
  • Build sandboxed testing environments and automated pipelines for continuous model assessment
  • Work closely with researchers to rapidly prototype and iterate on new evaluation approaches
  • Partner with cross-functional teams to advance Anthropic's safety mission
  • Contribute to Capability Reports that inform critical deployment decisions

Requirements For Research Engineer, Frontier Red Team

Python
  • Experience leading and conducting fast, iterative experiments with frontier AI models
  • Experience designing or implementing evaluations involving LLM sampling and prompting
  • Strong software engineering skills with extensive Python experience
  • Experience working with distributed systems
  • Ability to write clean, well-structured code
  • Strong interest in AI safety and responsible development
  • Self-starter mentality and comfort in fast-paced environments
  • Ability to balance urgency with careful implementation

Benefits For Research Engineer, Frontier Red Team

Visa Sponsorship
Parental Leave
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space in San Francisco

Interested in this job?

Jobs Related To Anthropic Research Engineer, Frontier Red Team

Research Engineer, Frontier Red Team

Senior Research Engineer position at Anthropic focusing on AI safety evaluation and risk assessment for frontier AI models.

Software Engineer, Model Context Protocol

Senior Software Engineer position at Anthropic focusing on Model Context Protocol development with competitive compensation range of $320K-$560K.

Software Engineer - Anthropic Labs

Software Engineer role at Anthropic Labs focusing on prototyping and evaluating emerging AI capabilities.

Software Engineer

Senior Software Engineer role at Anthropic focusing on building large-scale ML systems with emphasis on safety and reliability.

Trust and Safety Machine Learning Engineer

Senior ML Engineer role at Anthropic focusing on AI safety and trust mechanisms, offering competitive compensation and hybrid work in San Francisco.