Trust and Safety Software Engineer

Anthropic creates reliable, interpretable, and steerable AI systems, focusing on safe and beneficial AI development for users and society.
$304,800 - $412,750
Security
Mid-Level Software Engineer
Hybrid
3+ years of experience
AI · Cybersecurity

Description For Trust and Safety Software Engineer

Anthropic is seeking a Trust and Safety Software Engineer to join their mission of creating reliable, interpretable, and steerable AI systems. This role is crucial for building safety and oversight mechanisms for AI systems, focusing on monitoring models, preventing misuse, and ensuring user well-being. The position combines technical expertise with safety-conscious development, requiring skills in abuse detection, monitoring systems, and defensive infrastructure.

The ideal candidate will have 3-8+ years of software engineering experience, particularly in integrity, spam, fraud, or abuse detection. They'll work on developing sophisticated monitoring systems, building abuse detection mechanisms, and creating robust multi-layered defenses that operate at scale. The role requires proficiency in SQL, Python, and data analysis tools, along with strong communication skills.

Anthropic offers a collaborative environment where team members work together on large-scale research efforts, focusing on impactful AI development. The company values empirical science approaches and maintains a strong emphasis on communication and research discussions. They provide competitive compensation (£240,000 - £325,000 GBP), flexible working arrangements, and various benefits including visa sponsorship.

Located in San Francisco, Anthropic operates as a public benefit corporation, demonstrating their commitment to societal benefit. They encourage applications from diverse candidates, even those who might not meet every qualification, recognizing the importance of varied perspectives in addressing the social and ethical implications of AI systems. The hybrid work environment requires at least 25% office presence, fostering both flexibility and in-person collaboration.

Last updated 2 months ago

Responsibilities For Trust and Safety Software Engineer

  • Develop monitoring systems to detect unwanted behaviors from API partners and implement automated enforcement actions
  • Build abuse detection mechanisms and infrastructure
  • Surface abuse patterns to research teams to harden models at the training stage
  • Build robust multi-layered defenses for real-time improvement of safety mechanisms at scale
  • Analyze user reports of inappropriate content or accounts

Requirements For Trust and Safety Software Engineer

Python
  • Bachelor's degree in Computer Science, Software Engineering or comparable experience
  • 3-8+ years of experience in software engineering, preferably in integrity, spam, fraud, or abuse detection
  • Proficiency in SQL, Python, and data analysis tools
  • Strong communication skills and ability to explain complex technical concepts
  • Experience with trust and safety mechanisms for AI/ML systems (preferred)
  • Experience with machine learning frameworks (preferred)
  • Experience with prompt engineering and adversarial inputs (preferred)

Benefits For Trust and Safety Software Engineer

Visa Sponsorship
Equity
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration

Interested in this job?

Jobs Related To Anthropic Trust and Safety Software Engineer

Trust and Safety Software Engineer

Trust and Safety Software Engineer role at Anthropic focusing on building safety and oversight mechanisms for AI systems.

Trust and Safety Software Engineer

Trust and Safety Software Engineer at Anthropic: Build safety mechanisms for AI systems

Security Operations Engineer

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.

Security Engineer

Security Engineer position at DoorDash focusing on corporate security, zero-trust architecture, and endpoint security, requiring 3+ years of experience.

Security Operations Engineer

Security Operations Engineer position at Axon focusing on cloud security, incident response, and security tooling development.