Trust and Safety Machine Learning Engineer

AI research company focused on creating reliable, interpretable, and steerable AI systems for safe and beneficial use.
$340,000 - $425,000
Machine Learning
Senior Software Engineer
Hybrid
4+ years of experience
AI

Description For Trust and Safety Machine Learning Engineer

Anthropic is at the forefront of developing safe and beneficial AI systems, with a mission focused on creating reliable, interpretable, and steerable AI. As a Trust and Safety Machine Learning Engineer, you'll play a crucial role in building safety and oversight mechanisms for AI systems. The position combines technical ML expertise with a strong focus on ethical considerations and user well-being.

The role involves developing and implementing machine learning models for detecting harmful behaviors and ensuring compliance with terms of service. You'll work with cutting-edge AI technology while focusing on trust and safety applications, including behavioral classifiers and anomaly detection systems. The position requires both technical excellence and strong communication skills to bridge technical and non-technical stakeholders.

Anthropic operates as a public benefit corporation, emphasizing the importance of AI safety and ethical considerations in their work. The company approaches AI research as an empirical science, similar to physics and biology, and values collaborative work on large-scale research efforts over smaller, isolated projects. Their research portfolio includes significant work on GPT-3, Circuit-Based Interpretability, Multimodal Neurons, and other cutting-edge AI technologies.

The company offers a competitive compensation package starting from $340,000 to $425,000 USD, along with comprehensive benefits including visa sponsorship, flexible working arrangements, and generous leave policies. The hybrid work environment requires at least 25% office presence in San Francisco, fostering a collaborative atmosphere while maintaining flexibility. This is an excellent opportunity for experienced ML engineers who want to make a meaningful impact on the safe development of AI technology.

Last updated 4 months ago

Responsibilities For Trust and Safety Machine Learning Engineer

  • Build machine learning models to detect unwanted or anomalous behaviors from users and API partners
  • Integrate detection models into production system
  • Improve automated detection and enforcement systems
  • Analyze user reports of inappropriate accounts
  • Build machine learning models for proactive detection
  • Surface abuse patterns to research teams to harden models at training stage

Requirements For Trust and Safety Machine Learning Engineer

Python
PostgreSQL
  • 4+ years of experience in research/ML engineering or applied research scientist position
  • Proficiency in SQL, Python, and data analysis/data mining tools
  • Proficiency in building trust and safety AI/ML systems
  • Strong communication skills
  • Ability to explain complex technical concepts to non-technical stakeholders
  • Care about societal impacts and long-term implications of work

Benefits For Trust and Safety Machine Learning Engineer

Visa Sponsorship
Parental Leave
  • Competitive compensation and benefits
  • Optional equity donation matching
  • Generous vacation and parental leave
  • Flexible working hours
  • Office space for collaboration
  • Visa sponsorship available

Interested in this job?

Jobs Related To Anthropic Trust and Safety Machine Learning Engineer

Software Engineer

Senior Software Engineering role at Anthropic focusing on building and scaling ML systems, offering $280-485K salary with hybrid work model in SF, NYC, or Seattle.

Research Engineer

Senior Research Engineer position at Anthropic focusing on redesigning how AI systems interact with external data sources through innovative information architecture and LLM training.

Machine Learning Systems Engineer

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

Machine Learning Systems Engineer

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

ML Systems Engineer

ML Systems Engineer role at Anthropic focusing on building and improving AI model training systems and infrastructure.