ML Engineering Manager - Trust & Safety

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research and engineering.
$340,000 - $425,000
Machine Learning
Staff Software Engineer
Hybrid
501 - 1,000 Employees
5+ years of experience
AI

Description For ML Engineering Manager - Trust & Safety

Anthropic, a pioneering AI research company focused on creating reliable and safe AI systems, is seeking an ML Engineering Manager to lead their Trust & Safety organization. This role represents a unique opportunity to shape the future of AI safety and responsible deployment.

The position combines strategic leadership with hands-on technical expertise in machine learning, requiring a deep understanding of both AI safety research and trust & safety best practices. As the ML Engineering Manager, you'll lead a team developing AI-driven detection models and implementing practical safety measurements to protect and enhance Anthropic's AI services.

The role demands 5+ years of management experience in ML-focused environments and extensive experience in trust & safety or anti-fraud engineering. You'll work at the intersection of policy and technology, translating complex ML capabilities into effective protective measures while ensuring Anthropic's products remain both safe and accessible.

Anthropic offers a competitive compensation package ranging from $340,000 to $425,000 USD, along with benefits including flexible working hours, generous vacation and parental leave, and visa sponsorship opportunities. The position is based in San Francisco with a hybrid work arrangement requiring at least 25% office presence.

The company operates as a public benefit corporation and values diversity and inclusion, encouraging applications from candidates of all backgrounds. They approach AI research as an empirical science, working as a cohesive team on large-scale research efforts rather than smaller, specific puzzles.

This role is perfect for someone passionate about ensuring the responsible development of AI systems, with strong leadership abilities and excellent communication skills. You'll have the opportunity to work on cutting-edge AI technology while contributing to the important mission of making AI systems safe and beneficial for society.

The collaborative environment at Anthropic, combined with their focus on high-impact research and development, makes this an exceptional opportunity for someone looking to make a significant contribution to the field of AI safety and trust & safety engineering.

Last updated 9 days ago

Responsibilities For ML Engineering Manager - Trust & Safety

  • Set team vision and roadmap to detect and prevent harmful usage of Anthropic's AI services
  • Lead a team of ML and software engineers to translate AI capabilities into safety mechanisms
  • Partner with T&S Product, Policy, and Enforcement teams to identify risk vectors
  • Maintain understanding of AI safety research and trust & safety best practices
  • Drive collaborations between research and policy teams
  • Hire, support, and develop team members through feedback and career coaching

Requirements For ML Engineering Manager - Trust & Safety

  • 5+ years of management experience in technical ML-focused environment
  • 5+ years of experience in trust & safety or anti-fraud/risk engineering
  • Deep experience with techniques for detecting harmful content and platform misuse
  • Demonstrated ability to lead and manage high-performing technical teams
  • Excellent communication skills in translating complex technical concepts
  • Strong project management skills
  • Bachelor's degree in a related field or equivalent experience

Benefits For ML Engineering Manager - Trust & Safety

Visa Sponsorship
Parental Leave
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration

Interested in this job?

Jobs Related To Anthropic ML Engineering Manager - Trust & Safety

Developer Relations Lead

Lead Developer Relations at Anthropic, shaping how developers experience and build with Claude AI through technical programs, events, and community engagement.

Interpretability Research Engineer

Senior research engineering role at Anthropic focusing on AI interpretability and safety, offering competitive compensation and the opportunity to work on cutting-edge AI systems.

Developer Relations Lead

Lead Developer Relations at Anthropic, shaping how developers experience and build with Claude AI through technical programs, events, and community engagement.

Staff Software Engineer, Interpretability

Staff Software Engineer position at Anthropic focusing on AI interpretability research and development of tools for understanding and improving AI safety.

Research Engineer

Research Engineer for Anthropic's Pretraining team, developing safe and ethical large language models.