OpenAI's Fleet team is seeking a Software Engineer to support their computing environment that powers cutting-edge AI research and product development. This role focuses on managing large-scale systems across data centers, GPUs, and networking infrastructure, ensuring high availability and performance. The position is integral to enabling OpenAI's models, including ChatGPT, to operate at scale.
As a Software Engineer in Fleet Management, you'll be responsible for building systems to manage hardware, configurations, and vendor interactions. The role combines deep technical expertise in cloud and bare-metal infrastructure with a focus on automation and efficiency. You'll work in a hybrid environment (3 days in office) in San Francisco, with relocation assistance provided.
The ideal candidate should have extensive experience with cluster-level systems like Kubernetes and cloud providers, along with deep knowledge of server-level systems including Linux kernels and containerization. You'll be part of a team that prioritizes safety, reliability, and responsible AI deployment over unchecked growth.
OpenAI offers a competitive compensation package ranging from $360K to $440K, plus equity and comprehensive benefits including medical insurance, 401(k) matching, and generous parental leave. This is an opportunity to shape the future of AI technology while working with cutting-edge infrastructure at scale.
The role combines technical depth with strategic thinking, requiring someone who can both architect complex systems and collaborate effectively across teams. You'll be at the forefront of AI infrastructure, helping to build and maintain the systems that power some of the most advanced AI models in the world.