Research Scientist/Engineer - Alignment Finetuning

Anthropic

AI research company focused on creating reliable, interpretable, and steerable AI systems for safe and beneficial use.

San Francisco, CA, USA

$280,000 - $425,000

Machine Learning

Senior Software Engineer

Hybrid

101 - 500 Employees

5+ years of experience

Description For Research Scientist/Engineer - Alignment Finetuning

Anthropic is seeking a Research Scientist/Engineer to join their Alignment Finetuning team, focusing on developing AI systems that are safe and beneficial for society. This role combines cutting-edge ML research with practical implementation, working on training language models that demonstrate better moral reasoning, improved honesty, and good character.

The position offers an opportunity to work with a collaborative team of researchers, engineers, and policy experts in a company that approaches AI research as an empirical science. You'll be responsible for developing novel finetuning techniques, implementing advanced training pipelines, and creating evaluation frameworks to measure and improve model alignment properties.

The ideal candidate will have advanced education in Computer Science or ML, strong programming skills (particularly in Python), and experience with ML model training and experimentation. The role requires both technical expertise and the ability to collaborate effectively across teams.

Anthropic offers competitive compensation ($280,000 - $425,000), flexible working arrangements, and a strong benefits package. The company values diversity and encourages applications from candidates with varied backgrounds and perspectives. As a public benefit corporation, Anthropic is committed to developing AI systems that are reliable, interpretable, and beneficial for society.

Working at Anthropic means joining a cohesive team focused on large-scale research efforts rather than smaller, specific puzzles. The company's research builds on significant work in areas like GPT-3, Circuit-Based Interpretability, and AI Safety, making this an excellent opportunity for those interested in advancing the field of beneficial AI.

Last updated a month ago

Responsibilities For Research Scientist/Engineer - Alignment Finetuning

Develop and implement novel finetuning techniques using synthetic data generation and advanced training pipelines
Train models to have better alignment properties including honesty, character, and harmlessness
Create and maintain evaluation frameworks to measure alignment properties in models
Collaborate across teams to integrate alignment improvements into production models
Develop processes to help automate and scale the work of the team

Requirements For Research Scientist/Engineer - Alignment Finetuning

Python

MS/PhD in Computer Science, ML, or related field, or equivalent experience
Strong programming skills, especially in Python
Experience with ML model training and experimentation
Track record of implementing ML research
Strong analytical skills for interpreting experimental results
Experience with ML metrics and evaluation frameworks
Excellence at turning research ideas into working code
Ability to identify and resolve practical implementation challenges

Benefits For Research Scientist/Engineer - Alignment Finetuning

Visa Sponsorship

Equity

Competitive compensation and benefits
Optional equity donation matching
Generous vacation and parental leave
Flexible working hours
Office space for collaboration

Anthropic

AI research company focused on creating reliable, interpretable, and steerable AI systems for safe and beneficial use.

San Francisco, CA, USA

$280,000 - $425,000

Machine Learning

Senior Software Engineer

Hybrid

101 - 500 Employees

5+ years of experience

Interested in this job?

Jobs Related To Anthropic Research Scientist/Engineer - Alignment Finetuning

Machine Learning Systems Engineer

Anthropic

Senior Machine Learning Systems Engineer role at Anthropic, building evaluation infrastructure and research inference systems for AI development.

ML Systems Engineer

Anthropic

ML Systems Engineer role at Anthropic focusing on building and improving AI model training systems and infrastructure.

Software Engineer

Anthropic

Senior Software Engineer role at Anthropic focusing on building large-scale ML systems with emphasis on safety and reliability.

Research Engineer

Anthropic

Research Engineer position at Anthropic focusing on developing next-generation large language models with emphasis on safety and ethics.

Research Engineer, Alignment Science

Anthropic

Research Engineer position focusing on AI safety and alignment research at Anthropic's London office.