Trust and Safety Software Engineer

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$240,000 - $325,000
Security
Mid-Level Software Engineer
Hybrid
3+ years of experience
AI · Cybersecurity

Description For Trust and Safety Software Engineer

Anthropic is seeking a Trust and Safety Software Engineer to help build safety and oversight mechanisms for their AI systems. This role focuses on developing monitoring systems, abuse detection mechanisms, and multi-layered defenses to ensure the safe and ethical use of AI models. You'll work on detecting unwanted behaviors, preventing misuse, and ensuring user well-being while enforcing terms of service and acceptable use policies.

Key responsibilities include:

  • Developing monitoring systems for API partners
  • Building abuse detection infrastructure
  • Surfacing abuse patterns to research teams
  • Implementing real-time safety mechanisms at scale
  • Analyzing user reports of inappropriate content

The ideal candidate has:

  • A Bachelor's degree in Computer Science or equivalent experience
  • 3-8+ years of software engineering experience, preferably in integrity or abuse detection
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills

Anthropic offers a competitive compensation package including salary, equity, and comprehensive benefits. They provide a collaborative work environment, focusing on high-impact AI research and development. The company values diversity and encourages applications from underrepresented groups.

Join Anthropic in their mission to create safe and beneficial AI systems that can positively impact society as a whole.

Last updated 4 months ago

Responsibilities For Trust and Safety Software Engineer

  • Develop monitoring systems to detect unwanted behaviors from API partners
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to research teams
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms
  • Analyze user reports of inappropriate content or accounts

Requirements For Trust and Safety Software Engineer

Python
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 3-8+ years of experience in a software engineering position
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills

Benefits For Trust and Safety Software Engineer

401k
Dental Insurance
Education Budget
Equity
Medical Insurance
Parental Leave
Relocation Benefits
Vision Insurance
  • Equity donation matching
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support

Interested in this job?

Jobs Related To Anthropic Trust and Safety Software Engineer

Trust and Safety Software Engineer

Trust and Safety Software Engineer role at Anthropic focusing on building safety and oversight mechanisms for AI systems.

Trust and Safety Software Engineer

Trust and Safety Software Engineer role at Anthropic, focusing on building safety mechanisms for AI systems, requiring 3+ years of experience in software engineering and security.

Security Operations Engineer

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.

Security Engineer

Security Engineer position at DoorDash focusing on corporate security, zero-trust architecture, and endpoint security, requiring 3+ years of experience.

Security Operations Engineer

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.