OpenAI is seeking a Principal Engineer for their GPU Platform team within the Applied AI Infrastructure division in San Francisco. This role is crucial for running the infrastructure that powers ChatGPT and the API, focusing on inference kubernetes clusters, GPU health, Infiniband performance, and node lifecycle management. The position offers a competitive salary range of $405K-$590K plus equity, with a hybrid work model requiring 3 days in office.
The ideal candidate will have extensive experience (10+ years) in building core infrastructure, particularly with GPU clusters and Kubernetes at scale. They'll be responsible for designing and scaling inference infrastructure, ensuring system reliability, and participating in on-call rotations for critical incidents. The role demands someone who can thrive in ambiguous situations and adapt to rapid changes while maintaining high standards for system scalability and security.
OpenAI emphasizes safety over unfettered growth and is committed to ensuring AI benefits humanity. The company offers a comprehensive benefits package including relocation assistance, health benefits, and equity. They maintain a strong commitment to diversity, equity, and inclusion, seeking candidates who can contribute to a culture that challenges groupthink while making everyone feel welcome.
This is an exceptional opportunity for a senior technical leader to shape the future of AI infrastructure at one of the world's leading AI companies. The role combines technical excellence with the chance to work on cutting-edge AI systems that have real-world impact. The position requires being based in San Francisco, with the company providing relocation support for new employees.