Trust and Safety Software Engineer

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.

San Francisco, CA, USA • London, UK

$240,000 - $325,000

Security

Mid-Level Software Engineer

Hybrid

3+ years of experience

AI · Cybersecurity

Description For Trust and Safety Software Engineer

Anthropic is seeking a Trust and Safety Software Engineer to help build safety and oversight mechanisms for their AI systems. This role focuses on developing monitoring systems, abuse detection mechanisms, and multi-layered defenses to ensure the safe and ethical use of AI models. You'll work on detecting unwanted behaviors, preventing misuse, and ensuring user well-being while enforcing terms of service and acceptable use policies.

Key responsibilities include:

Developing monitoring systems for API partners
Building abuse detection infrastructure
Surfacing abuse patterns to research teams
Implementing real-time safety mechanisms at scale
Analyzing user reports of inappropriate content

The ideal candidate has:

A Bachelor's degree in Computer Science or equivalent experience
3-8+ years of software engineering experience, preferably in integrity or abuse detection
Proficiency in SQL, Python, and data analysis tools
Strong communication skills

Anthropic offers a competitive compensation package including salary, equity, and comprehensive benefits. They provide a collaborative work environment, focusing on high-impact AI research and development. The company values diversity and encourages applications from underrepresented groups.

Join Anthropic in their mission to create safe and beneficial AI systems that can positively impact society as a whole.

Last updated 4 months ago

Responsibilities For Trust and Safety Software Engineer

Develop monitoring systems to detect unwanted behaviors from API partners
Build abuse detection mechanisms and infrastructure
Surface abuse patterns to research teams
Build robust and reliable multi-layered defenses for real-time improvement of safety mechanisms
Analyze user reports of inappropriate content or accounts

Requirements For Trust and Safety Software Engineer

Python

Bachelor's degree in Computer Science, Software Engineering or comparable experience
3-8+ years of experience in a software engineering position
Proficiency in SQL, Python, and data analysis tools
Strong communication skills

Benefits For Trust and Safety Software Engineer

401k

Dental Insurance

Education Budget

Equity

Medical Insurance

Parental Leave

Relocation Benefits

Vision Insurance

Equity donation matching
Health insurance
Dental insurance
Vision insurance
401(k) with 4% matching
22 weeks paid parental leave
Unlimited PTO
Education stipend
Home office improvement stipend
Commuting stipend
Wellness stipend
Fertility benefits
Daily lunches and snacks
Relocation support

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems for safe and beneficial use.

San Francisco, CA, USA • London, UK

$240,000 - $325,000

Security

Mid-Level Software Engineer

Hybrid

3+ years of experience

AI · Cybersecurity

Interested in this job?

Jobs Related To Anthropic Trust and Safety Software Engineer

Trust and Safety Software Engineer

Anthropic

Trust and Safety Software Engineer role at Anthropic focusing on building safety and oversight mechanisms for AI systems.

Trust and Safety Software Engineer

Anthropic

Trust and Safety Software Engineer role at Anthropic, focusing on building safety mechanisms for AI systems, requiring 3+ years of experience in software engineering and security.

Security Operations Engineer

Axon

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.

Security Engineer

DoorDash

Security Engineer position at DoorDash focusing on corporate security, zero-trust architecture, and endpoint security, requiring 3+ years of experience.

Security Operations Engineer

Axon

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.