ML Engineering Manager - Trust & Safety

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research and engineering.

San Francisco, CA, USA

$340,000 - $425,000

Machine Learning

Staff Software Engineer

Hybrid

501 - 1,000 Employees

5+ years of experience

Description For ML Engineering Manager - Trust & Safety

Anthropic, a pioneering AI research company focused on creating reliable and safe AI systems, is seeking an ML Engineering Manager to lead their Trust & Safety organization. This role represents a unique opportunity to shape the future of AI safety and responsible deployment.

The position combines strategic leadership with hands-on technical expertise in machine learning, requiring a deep understanding of both AI safety research and trust & safety best practices. As the ML Engineering Manager, you'll lead a team developing AI-driven detection models and implementing practical safety measurements to protect and enhance Anthropic's AI services.

The role demands 5+ years of management experience in ML-focused environments and extensive experience in trust & safety or anti-fraud engineering. You'll work at the intersection of policy and technology, translating complex ML capabilities into effective protective measures while ensuring Anthropic's products remain both safe and accessible.

Anthropic offers a competitive compensation package ranging from $340,000 to $425,000 USD, along with benefits including flexible working hours, generous vacation and parental leave, and visa sponsorship opportunities. The position is based in San Francisco with a hybrid work arrangement requiring at least 25% office presence.

The company operates as a public benefit corporation and values diversity and inclusion, encouraging applications from candidates of all backgrounds. They approach AI research as an empirical science, working as a cohesive team on large-scale research efforts rather than smaller, specific puzzles.

This role is perfect for someone passionate about ensuring the responsible development of AI systems, with strong leadership abilities and excellent communication skills. You'll have the opportunity to work on cutting-edge AI technology while contributing to the important mission of making AI systems safe and beneficial for society.

The collaborative environment at Anthropic, combined with their focus on high-impact research and development, makes this an exceptional opportunity for someone looking to make a significant contribution to the field of AI safety and trust & safety engineering.

Last updated a month ago

Responsibilities For ML Engineering Manager - Trust & Safety

Set team vision and roadmap to detect and prevent harmful usage of Anthropic's AI services
Lead a team of ML and software engineers to translate AI capabilities into safety mechanisms
Partner with T&S Product, Policy, and Enforcement teams to identify risk vectors
Maintain understanding of AI safety research and trust & safety best practices
Drive collaborations between research and policy teams
Hire, support, and develop team members through feedback and career coaching

Requirements For ML Engineering Manager - Trust & Safety

5+ years of management experience in technical ML-focused environment
5+ years of experience in trust & safety or anti-fraud/risk engineering
Deep experience with techniques for detecting harmful content and platform misuse
Demonstrated ability to lead and manage high-performing technical teams
Excellent communication skills in translating complex technical concepts
Strong project management skills
Bachelor's degree in a related field or equivalent experience

Benefits For ML Engineering Manager - Trust & Safety

Visa Sponsorship

Parental Leave

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space for collaboration

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development through research and engineering.

San Francisco, CA, USA

$340,000 - $425,000

Machine Learning

Staff Software Engineer

Hybrid

501 - 1,000 Employees

5+ years of experience

Interested in this job?

Jobs Related To Anthropic ML Engineering Manager - Trust & Safety

Research Scientist/Engineer - Finetuning Alignment

Anthropic

Research Scientist/Engineer position at Anthropic focusing on developing truthful and reliable AI systems through advanced finetuning and alignment techniques.

Research Scientist/Engineer - Finetuning Alignment

Anthropic

Research Scientist/Engineer position at Anthropic focusing on developing truthful and reliable AI systems through advanced finetuning and alignment techniques.

Developer Relations Lead

Anthropic

Lead Developer Relations at Anthropic, shaping how developers experience and build with Claude AI through technical programs, events, and community engagement.

Interpretability Research Engineer

Anthropic

Senior research engineering role at Anthropic focusing on AI interpretability and safety, offering competitive compensation and the opportunity to work on cutting-edge AI systems.

Developer Relations Lead

Anthropic

Lead Developer Relations at Anthropic, shaping how developers experience and build with Claude AI through technical programs, events, and community engagement.