Anthropic is seeking a Trust and Safety Software Engineer to help build safety and oversight mechanisms for their AI systems. This role focuses on developing monitoring systems, abuse detection mechanisms, and multi-layered defenses to ensure the safe and ethical use of AI models. You'll work on detecting unwanted behaviors, preventing misuse, and ensuring user well-being while enforcing terms of service and acceptable use policies.
Key responsibilities include:
The ideal candidate has:
Anthropic offers a competitive compensation package including salary, equity, and comprehensive benefits. They provide a collaborative work environment, focusing on high-impact AI research and development. The company values diversity and encourages applications from underrepresented groups.
Join Anthropic in their mission to create safe and beneficial AI systems that can positively impact society as a whole.