Research Infrastructure Engineer - Post-Training

OpenAI

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits all of humanity.

San Francisco, CA, USA

$310,000 - $460,000

Backend

Senior Software Engineer

Hybrid

1,000 - 5,000 Employees

5+ years of experience

Description For Research Infrastructure Engineer - Post-Training

OpenAI is seeking a Research Infrastructure Engineer to join their post-training team, focusing on transforming large pre-trained models into user-friendly chatbots like ChatGPT. This role combines deep technical expertise with ML systems optimization and distributed systems knowledge. Based in San Francisco with a hybrid work model (3 days in office), the position offers a competitive salary range of $310K-$460K plus equity and comprehensive benefits.

The role involves working across the entire technology stack, from optimizing low-level ML systems to managing job orchestration and data evaluation. You'll be responsible for building cutting-edge infrastructure and tools fundamental to ChatGPT's post-training phase. The team collaborates closely with research groups, creating systems that push the boundaries of what's possible with ChatGPT.

Key responsibilities include ensuring smooth operation of ChatGPT training systems, debugging complex ML codebases, building data management tools, and creating reusable Python libraries. You'll work on projects like profiling large model reinforcement learning training, identifying experiment failures, and redesigning data pipelines for multimodal data.

The ideal candidate should have experience with Python, Kubernetes, distributed infrastructure, GPUs, and large-scale data systems. Knowledge of reinforcement learning and transformers is crucial. While research experience isn't mandatory, experience collaborating with ML researchers in an applied setting is highly valued.

OpenAI offers an exceptional benefits package including medical/dental/vision insurance, mental health support, 401(k) matching, generous parental leave, and learning stipends. The company is committed to diversity, equality, and ensuring AI benefits all of humanity. This is an opportunity to shape the future of AI technology while working with cutting-edge systems and brilliant minds in the field.

Last updated 3 hours ago

Responsibilities For Research Infrastructure Engineer - Post-Training

Ensure systems powering ChatGPT training and development run smoothly
Debug and analyze large ML codebases
Build tools for data management, model configuration, and evaluation
Create reusable Python libraries with great abstractions
Profile and optimize large model reinforcement learning training
Identify and address system bottlenecks
Redesign data pipelines for multimodal data
Build front-end evaluation tooling

Requirements For Research Infrastructure Engineer - Post-Training

Python

Kubernetes

Experience working in complex technical environments
Experience debugging ML systems
Experience with reinforcement learning and transformers
Experience with Python
Experience with kubernetes / distributed infrastructure
Experience with GPUs
Experience with large scale data systems (beam or spark)
Team player mindset

Benefits For Research Infrastructure Engineer - Post-Training

Medical Insurance

Dental Insurance

Vision Insurance

Mental Health Assistance

401k

Parental Leave

Education Budget

Equity

Medical, dental, and vision insurance for you and your family
Mental health and wellness support
401(k) plan with 50% matching
Generous time off and company holidays
24 weeks paid birth-parent leave & 20-week paid parental leave
Annual learning & development stipend ($1,500 per year)
Equity compensation
Relocation assistance available

OpenAI

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits all of humanity.

San Francisco, CA, USA

$310,000 - $460,000

Backend

Senior Software Engineer

Hybrid

1,000 - 5,000 Employees

5+ years of experience

Interested in this job?

Jobs Related To OpenAI Research Infrastructure Engineer - Post-Training

Software Engineer, Financial Engineering

OpenAI

Senior Software Engineer role at OpenAI focusing on building and architecting financial engineering systems for billing and monetization.

Software Engineer, Internal Applications - Enterprise

OpenAI

Senior Software Engineer role at OpenAI focusing on internal applications and enterprise infrastructure automation using modern cloud technologies and DevOps practices.

Software Engineer, Financial Engineering

OpenAI

Senior Software Engineer role at OpenAI focusing on building and architecting next-generation billing and monetization systems.

Software Engineer, Internal Applications - Enterprise

OpenAI

Senior Software Engineer role at OpenAI focusing on internal applications and enterprise infrastructure automation

Software Engineer, Backend

OpenAI

Senior Backend Software Engineer role at OpenAI, building and scaling systems for ChatGPT and other AI products with a focus on reliability and security.