Trust and Safety Machine Learning Engineer

AI research company focused on creating reliable, interpretable, and steerable AI systems for safe and beneficial use.
$340,000 - $425,000
Machine Learning
Senior Software Engineer
Hybrid
4+ years of experience
AI

Description For Trust and Safety Machine Learning Engineer

Anthropic is at the forefront of developing safe and beneficial AI systems, with a mission focused on creating reliable, interpretable, and steerable AI. As a Trust and Safety Machine Learning Engineer, you'll play a crucial role in building safety and oversight mechanisms for AI systems. The position combines technical ML expertise with a strong focus on ethical considerations and user well-being.

The role involves developing and implementing machine learning models for detecting harmful behaviors and ensuring compliance with terms of service. You'll work with cutting-edge AI technology while focusing on trust and safety applications, including behavioral classifiers and anomaly detection systems. The position requires both technical excellence and strong communication skills to bridge technical and non-technical stakeholders.

Anthropic operates as a public benefit corporation, emphasizing the importance of AI safety and ethical considerations in their work. The company approaches AI research as an empirical science, similar to physics and biology, and values collaborative work on large-scale research efforts over smaller, isolated projects. Their research portfolio includes significant work on GPT-3, Circuit-Based Interpretability, Multimodal Neurons, and other cutting-edge AI technologies.

The company offers a competitive compensation package starting from $340,000 to $425,000 USD, along with comprehensive benefits including visa sponsorship, flexible working arrangements, and generous leave policies. The hybrid work environment requires at least 25% office presence in San Francisco, fostering a collaborative atmosphere while maintaining flexibility. This is an excellent opportunity for experienced ML engineers who want to make a meaningful impact on the safe development of AI technology.

Last updated a day ago

Responsibilities For Trust and Safety Machine Learning Engineer

  • Build machine learning models to detect unwanted or anomalous behaviors from users and API partners
  • Integrate detection models into production system
  • Improve automated detection and enforcement systems
  • Analyze user reports of inappropriate accounts
  • Build machine learning models for proactive detection
  • Surface abuse patterns to research teams to harden models at training stage

Requirements For Trust and Safety Machine Learning Engineer

Python
PostgreSQL
  • 4+ years of experience in research/ML engineering or applied research scientist position
  • Proficiency in SQL, Python, and data analysis/data mining tools
  • Proficiency in building trust and safety AI/ML systems
  • Strong communication skills
  • Ability to explain complex technical concepts to non-technical stakeholders
  • Care about societal impacts and long-term implications of work

Benefits For Trust and Safety Machine Learning Engineer

Visa Sponsorship
Parental Leave
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration
  • Visa sponsorship available

Interested in this job?

Jobs Related To Anthropic Trust and Safety Machine Learning Engineer

Machine Learning Research Engineer

Senior ML Research Engineer role at Anthropic focusing on developing safe and reliable AI systems, including large language models and multimodal capabilities.

Research Scientist/Engineer - AI Safety (Biosecurity)

Join Anthropic's team to research and mitigate extreme risks from future AI models, focusing on biosecurity.

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models

Research Engineer - Reinforcement Learning Fundamentals

Research Engineer role at Anthropic focusing on reinforcement learning for large language models.

Software Engineer

Senior Software Engineer role at Anthropic, working on large-scale ML systems for safe and beneficial AI.