Trust and Safety Machine Learning Engineer

Anthropic

AI research company focused on creating reliable, interpretable, and steerable AI systems for safe and beneficial use.

San Francisco, CA, USA

$340,000 - $425,000

Machine Learning

Senior Software Engineer

Hybrid

4+ years of experience

Description For Trust and Safety Machine Learning Engineer

Anthropic is at the forefront of developing safe and beneficial AI systems, with a mission focused on creating reliable, interpretable, and steerable AI. As a Trust and Safety Machine Learning Engineer, you'll play a crucial role in building safety and oversight mechanisms for AI systems. The position combines technical ML expertise with a strong focus on ethical considerations and user well-being.

The role involves developing and implementing machine learning models for detecting harmful behaviors and ensuring compliance with terms of service. You'll work with cutting-edge AI technology while focusing on trust and safety applications, including behavioral classifiers and anomaly detection systems. The position requires both technical excellence and strong communication skills to bridge technical and non-technical stakeholders.

Anthropic operates as a public benefit corporation, emphasizing the importance of AI safety and ethical considerations in their work. The company approaches AI research as an empirical science, similar to physics and biology, and values collaborative work on large-scale research efforts over smaller, isolated projects. Their research portfolio includes significant work on GPT-3, Circuit-Based Interpretability, Multimodal Neurons, and other cutting-edge AI technologies.

The company offers a competitive compensation package starting from $340,000 to $425,000 USD, along with comprehensive benefits including visa sponsorship, flexible working arrangements, and generous leave policies. The hybrid work environment requires at least 25% office presence in San Francisco, fostering a collaborative atmosphere while maintaining flexibility. This is an excellent opportunity for experienced ML engineers who want to make a meaningful impact on the safe development of AI technology.

Last updated 4 months ago

Responsibilities For Trust and Safety Machine Learning Engineer

Build machine learning models to detect unwanted or anomalous behaviors from users and API partners
Integrate detection models into production system
Improve automated detection and enforcement systems
Analyze user reports of inappropriate accounts
Build machine learning models for proactive detection
Surface abuse patterns to research teams to harden models at training stage

Requirements For Trust and Safety Machine Learning Engineer

Python

PostgreSQL

4+ years of experience in research/ML engineering or applied research scientist position
Proficiency in SQL, Python, and data analysis/data mining tools
Proficiency in building trust and safety AI/ML systems
Strong communication skills
Ability to explain complex technical concepts to non-technical stakeholders
Care about societal impacts and long-term implications of work

Benefits For Trust and Safety Machine Learning Engineer

Visa Sponsorship

Parental Leave

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space for collaboration
Visa sponsorship available

Anthropic

AI research company focused on creating reliable, interpretable, and steerable AI systems for safe and beneficial use.

San Francisco, CA, USA

$340,000 - $425,000

Machine Learning

Senior Software Engineer

Hybrid

4+ years of experience

Interested in this job?

Jobs Related To Anthropic Trust and Safety Machine Learning Engineer

Software Engineer

Anthropic

Senior Software Engineering role at Anthropic focusing on building and scaling ML systems, offering $280-485K salary with hybrid work model in SF, NYC, or Seattle.

Research Engineer

Anthropic

Senior Research Engineer position at Anthropic focusing on redesigning how AI systems interact with external data sources through innovative information architecture and LLM training.

Machine Learning Systems Engineer

Anthropic

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

Machine Learning Systems Engineer

Anthropic

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

ML Systems Engineer

Anthropic

ML Systems Engineer role at Anthropic focusing on building and improving AI model training systems and infrastructure.