Trust and Safety Software Engineer

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.
$240,000 - $325,000
Security
Mid-Level Software Engineer
Hybrid
3+ years of experience
AI · Cybersecurity

Description For Trust and Safety Software Engineer

Anthropic is seeking a Trust and Safety Software Engineer to help build safety and oversight mechanisms for their AI systems. This role focuses on developing monitoring systems, abuse detection mechanisms, and multi-layered defenses to ensure the safe and ethical use of AI models. You'll work on detecting unwanted behaviors, preventing misuse, and ensuring user well-being while enforcing terms of service and acceptable use policies.

Key responsibilities include:

  • Developing monitoring systems for API partners
  • Building abuse detection infrastructure
  • Surfacing abuse patterns to research teams
  • Implementing real-time safety mechanisms at scale
  • Analyzing user reports of inappropriate content

The ideal candidate has:

  • A Bachelor's degree in Computer Science or equivalent experience
  • 3-8+ years of software engineering experience, preferably in integrity or abuse detection
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills

Anthropic offers a competitive compensation package including salary, equity, and comprehensive benefits. They provide a collaborative work environment, focusing on high-impact AI research and development. The company values diversity and encourages applications from underrepresented groups.

Join Anthropic in their mission to create safe and beneficial AI systems that can positively impact society as a whole.

Last updated a month ago

Responsibilities For Trust and Safety Software Engineer

  • Develop monitoring systems to detect unwanted behaviors from API partners
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to research teams
  • Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms
  • Analyze user reports of inappropriate content or accounts

Requirements For Trust and Safety Software Engineer

Python
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 3-8+ years of experience in a software engineering position
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills

Benefits For Trust and Safety Software Engineer

401k
Dental Insurance
Education Budget
Equity
Medical Insurance
Parental Leave
Relocation Benefits
Vision Insurance
  • Equity donation matching
  • Health insurance
  • Dental insurance
  • Vision insurance
  • 401(k) with 4% matching
  • 22 weeks paid parental leave
  • Unlimited PTO
  • Education stipend
  • Home office improvement stipend
  • Commuting stipend
  • Wellness stipend
  • Fertility benefits
  • Daily lunches and snacks
  • Relocation support

Interested in this job?

Jobs Related To Anthropic Trust and Safety Software Engineer

Software Engineer ll, Security Software, Silicon

Security Software Engineer role at Google focusing on embedded systems, ROM, and bootloader development for custom silicon initiatives.

Technical Program Manager II, Security, Google Cloud

Technical Program Manager II position at Google Cloud focusing on security initiatives, requiring 2+ years of program management experience and technical expertise.

Software Engineer III, Google Cloud Security and Privacy

Mid-level Software Engineer position at Google Cloud focusing on security and privacy solutions, offering competitive compensation and opportunities for growth.

Software Engineer III, Google Cloud Security and Privacy

Software Engineer III position at Google Cloud focusing on security and privacy, offering competitive compensation and opportunity to work on large-scale systems.

Technical Solutions Engineer, Google Cloud Security

Technical Solutions Engineer position at Google Cloud Security, combining software development, networking, and customer support expertise to help businesses optimize their cloud security implementations.