Trust and Safety Software Engineer

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development for users and society.

San Francisco, CA, USA

$304,800 - $412,750

Security

Mid-Level Software Engineer

Hybrid

3+ years of experience

AI · Cybersecurity

Description For Trust and Safety Software Engineer

Anthropic is seeking a Trust and Safety Software Engineer to join their mission of creating reliable, interpretable, and steerable AI systems. This role is crucial for building safety and oversight mechanisms for AI systems, focusing on monitoring models, preventing misuse, and ensuring user well-being. The position combines technical expertise with safety-conscious development, requiring skills in abuse detection, monitoring systems, and defensive infrastructure.

The ideal candidate will have 3-8+ years of software engineering experience, particularly in integrity, spam, fraud, or abuse detection. They'll work on developing sophisticated monitoring systems, building abuse detection mechanisms, and creating robust multi-layered defenses that operate at scale. The role requires proficiency in SQL, Python, and data analysis tools, along with strong communication skills.

Anthropic offers a collaborative environment where team members work together on large-scale research efforts, focusing on impactful AI development. The company values empirical science approaches and maintains a strong emphasis on communication and research discussions. They provide competitive compensation (£240,000 - £325,000 GBP), flexible working arrangements, and various benefits including visa sponsorship.

Located in San Francisco, Anthropic operates as a public benefit corporation, demonstrating their commitment to societal benefit. They encourage applications from diverse candidates, even those who might not meet every qualification, recognizing the importance of varied perspectives in addressing the social and ethical implications of AI systems. The hybrid work environment requires at least 25% office presence, fostering both flexibility and in-person collaboration.

Last updated 2 months ago

Responsibilities For Trust and Safety Software Engineer

Develop monitoring systems to detect unwanted behaviors from API partners and implement automated enforcement actions
Build abuse detection mechanisms and infrastructure
Surface abuse patterns to research teams to harden models at the training stage
Build robust multi-layered defenses for real-time improvement of safety mechanisms at scale
Analyze user reports of inappropriate content or accounts

Requirements For Trust and Safety Software Engineer

Python

Bachelor's degree in Computer Science, Software Engineering or comparable experience
3-8+ years of experience in software engineering, preferably in integrity, spam, fraud, or abuse detection
Proficiency in SQL, Python, and data analysis tools
Strong communication skills and ability to explain complex technical concepts
Experience with trust and safety mechanisms for AI/ML systems (preferred)
Experience with machine learning frameworks (preferred)
Experience with prompt engineering and adversarial inputs (preferred)

Benefits For Trust and Safety Software Engineer

Visa Sponsorship

Equity

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space for collaboration

Anthropic

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development for users and society.

San Francisco, CA, USA

$304,800 - $412,750

Security

Mid-Level Software Engineer

Hybrid

3+ years of experience

AI · Cybersecurity

Interested in this job?

Jobs Related To Anthropic Trust and Safety Software Engineer

Trust and Safety Software Engineer

Anthropic

Trust and Safety Software Engineer role at Anthropic focusing on building safety and oversight mechanisms for AI systems.

Trust and Safety Software Engineer

Anthropic

Trust and Safety Software Engineer at Anthropic: Build safety mechanisms for AI systems

Security Operations Engineer

Axon

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.

Security Engineer

DoorDash

Security Engineer position at DoorDash focusing on corporate security, zero-trust architecture, and endpoint security, requiring 3+ years of experience.

Security Operations Engineer

Axon

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.