Trust and Safety Software Engineer

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$240,000 - $325,000
Security
Mid-Level Software Engineer
Hybrid
3+ years of experience
AI · Cybersecurity

Description For Trust and Safety Software Engineer

Anthropic is seeking a Trust and Safety Software Engineer to help build safety and oversight mechanisms for their AI systems. This role focuses on developing monitoring systems, abuse detection mechanisms, and multi-layered defenses to ensure the safe and ethical use of AI models. You'll work on detecting unwanted behaviors, preventing misuse, and ensuring user well-being while enforcing terms of service and acceptable use policies.

Key responsibilities include:

  • Developing monitoring systems for API partners
  • Building abuse detection infrastructure
  • Surfacing abuse patterns to research teams
  • Implementing real-time safety mechanisms at scale
  • Analyzing user reports of inappropriate content

The ideal candidate has:

  • A Bachelor's degree in Computer Science or equivalent experience
  • 3-8+ years of software engineering experience, preferably in integrity or abuse detection
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills

Anthropic offers a competitive compensation package including salary, equity, and comprehensive benefits. They provide a collaborative work environment, focusing on high-impact AI research and development. The company values diversity and encourages applications from underrepresented groups.

Join Anthropic in their mission to create safe and beneficial AI systems that can positively impact society as a whole.

Last updated 2 months ago

Responsibilities For Trust and Safety Software Engineer

  • Develop monitoring systems to detect unwanted behaviors from API partners
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to research teams
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms
  • Analyze user reports of inappropriate content or accounts

Requirements For Trust and Safety Software Engineer

Python
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 3-8+ years of experience in a software engineering position
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills

Benefits For Trust and Safety Software Engineer

401k
Dental Insurance
Education Budget
Equity
Medical Insurance
Parental Leave
Relocation Benefits
Vision Insurance
  • Equity donation matching
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support

Interested in this job?

Jobs Related To Anthropic Trust and Safety Software Engineer

Software Dev Engineer II, Amazon Foundational Security Services

AWS Security role focusing on building large-scale Permissions and Access Management Systems with company-wide impact at Amazon.

Software Engineer II

Software Engineer II position at Microsoft focusing on AI security and safety, developing tools and automation for incident response in AI systems.

Software Engineer 2

Microsoft seeks Security Software Engineer II to develop OS security features, offering hybrid work and competitive benefits.

Networks & Security Engineer

Networks & Security Engineer position at ADAPTIT S.A., focusing on enterprise network infrastructure and security management with 2-3+ years experience required.

Systems Development Engineer, Amazon Security Platform Engineering

Systems Development Engineer role at Amazon Security focusing on building scalable security systems and managing security log collection infrastructure.