Principal Engineer, GPU Platform

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$385,000 - $550,000
Backend
Principal Software Engineer
Hybrid
10+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Lead Architect/Developer Principle Engineer

Senior Lead Architect role at Wells Fargo focusing on modernizing trading systems and leading technical initiatives in Capital Markets technology.

Director of Software Engineering, Quality

Lead quality engineering teams at Salesforce, implementing quality strategies and driving continuous improvement in software development.

Platform Reliability, Availability, Serviceability (RAS), and Manageability Software Architect

Senior software architect role at Qualcomm focusing on platform reliability, availability, and serviceability for data center solutions.

Sr. CPU Performance Modeling Architect

Senior CPU Performance Modeling Architect role at Qualcomm focusing on high-performance, energy-efficient CPU design and modeling.

Principal Staff Software Engineer - Validation & Resilience Engineering

Principal Staff Software Engineer role at LinkedIn focusing on validation and resilience engineering for large-scale distributed systems.

Description For Principal Engineer, GPU Platform

OpenAI is seeking a Principal Engineer for their GPU Platform team within the Applied Engineering department. This role is crucial in building and maintaining the infrastructure that supports models like ChatGPT and the API. The ideal candidate will have extensive experience in core infrastructure, GPU clusters, and orchestration systems like Kubernetes.

The role involves designing and building inference infrastructure to ensure reliability and performance, scaling systems to the next order of magnitude, and contributing to a diverse and inclusive culture. The position is based in San Francisco, with a hybrid work model of 3 days in the office per week.

Key responsibilities include:

  • Developing scalable and reliable inference infrastructure
  • Ensuring system scalability for future growth
  • Participating in on-call rotations for critical incident response
  • Contributing to a positive and inclusive work environment

The ideal candidate will have:

  • 10+ years of experience in core infrastructure
  • Expertise in running GPU clusters at scale
  • Proficiency with orchestration systems like Kubernetes
  • Ability to thrive in ambiguous and rapidly changing environments
  • Passion for building secure and scalable systems

OpenAI offers a competitive compensation package, including a salary range of $385K – $550K, equity, and comprehensive benefits such as medical, dental, and vision insurance, mental health support, 401(k) matching, unlimited time off, and paid parental leave.

Join OpenAI in their mission to ensure that general-purpose AI benefits all of humanity. They are committed to safety, responsible AI development, and creating a diverse and inclusive work environment. If you're passionate about pushing the boundaries of AI capabilities and want to make a significant impact in the field, this role offers an exciting opportunity to shape the future of technology.

Last updated 7 months ago

Responsibilities For Principal Engineer, GPU Platform

  • Design and build the inference infrastructure that power our products, enabling reliability and performance
  • Ensure our infrastructure can scale to the next order of magnitude
  • Help create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think
  • Participate in on-call rotation to respond to critical incidents as needed

Requirements For Principal Engineer, GPU Platform

Kubernetes
Linux
  • 10+ years building core infrastructure
  • Experience running GPU clusters at scale
  • Experience operating orchestration systems such as Kubernetes at scale
  • Comfortable with ambiguity and rapid change
  • Take pride in building and operating scalable, reliable, secure systems

Benefits For Principal Engineer, GPU Platform

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
401k
Parental Leave
Education Budget
Equity
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (24 weeks) and family-planning support
  • Annual learning & development stipend ($1,500 per year)
  • Relocation assistance

Interested in this job?