Principal Engineer, GPU Platform

OpenAI

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits all of humanity.

San Francisco, CA, USA

$405,000 - $590,000

Cloud

Principal Software Engineer

Hybrid

1,000 - 5,000 Employees

10+ years of experience

Description For Principal Engineer, GPU Platform

OpenAI is seeking a Principal Engineer for their GPU Platform team within the Applied AI Infrastructure division in San Francisco. This role is crucial for running the infrastructure that powers ChatGPT and the API, focusing on inference kubernetes clusters, GPU health, Infiniband performance, and node lifecycle management. The position offers a competitive salary range of $405K-$590K plus equity, with a hybrid work model requiring 3 days in office.

The ideal candidate will have extensive experience (10+ years) in building core infrastructure, particularly with GPU clusters and Kubernetes at scale. They'll be responsible for designing and scaling inference infrastructure, ensuring system reliability, and participating in on-call rotations for critical incidents. The role demands someone who can thrive in ambiguous situations and adapt to rapid changes while maintaining high standards for system scalability and security.

OpenAI emphasizes safety over unfettered growth and is committed to ensuring AI benefits humanity. The company offers a comprehensive benefits package including relocation assistance, health benefits, and equity. They maintain a strong commitment to diversity, equity, and inclusion, seeking candidates who can contribute to a culture that challenges groupthink while making everyone feel welcome.

This is an exceptional opportunity for a senior technical leader to shape the future of AI infrastructure at one of the world's leading AI companies. The role combines technical excellence with the chance to work on cutting-edge AI systems that have real-world impact. The position requires being based in San Francisco, with the company providing relocation support for new employees.

Last updated 4 days ago

Responsibilities For Principal Engineer, GPU Platform

Design and build the inference infrastructure that power our products, enabling reliability and performance
Ensure our infrastructure can scale to the next order of magnitude
Help create a diverse, equitable, and inclusive culture
Participate in on-call rotation to respond to critical incidents as needed

Requirements For Principal Engineer, GPU Platform

Kubernetes

Linux

10+ years building core infrastructure
Experience running GPU clusters at scale
Experience operating orchestration systems such as Kubernetes at scale
Ability to build and operate scalable, reliable, secure systems
Comfortable with ambiguity and rapid change

Benefits For Principal Engineer, GPU Platform

Medical Insurance

Relocation Benefits

Equity

Equity
Relocation assistance
Health benefits

OpenAI

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits all of humanity.

San Francisco, CA, USA

$405,000 - $590,000

Cloud

Principal Software Engineer

Hybrid

1,000 - 5,000 Employees

10+ years of experience

Interested in this job?

Jobs Related To OpenAI Principal Engineer, GPU Platform

Principal Cloud Architect, NA

PepsiCo

Lead North America Cloud infrastructure Operations at PepsiCo, overseeing cloud architecture and strategy while managing a team of SREs. Compensation range $118,700-$198,800 plus benefits.

Principal Product Solutions Engineer

Oracle

Principal Product Solutions Engineer position at Oracle, focusing on cloud infrastructure and customer solutions, requiring 6+ years of experience and bilingual proficiency.

Principal Software Engineer

Oracle

Principal Software Engineer position at Oracle focusing on cloud infrastructure and enterprise software development, offering competitive compensation and comprehensive benefits.

Principal Cloud Solution Engineer

Oracle

Principal Cloud Solution Engineer position at Oracle, leading technical pre-sales and cloud architecture for strategic customers, requiring 10+ years of experience and deep expertise in cloud technologies.

Principal Engineer - Accelerator Business

JPMorgan Chase

Principal Engineer role at JPMorgan Chase focusing on cloud infrastructure, Kubernetes, and modern development practices in London.