Principal Engineer, GPU Platform

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$385,000 - $550,000
Backend
Principal Software Engineer
Hybrid
10+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Principal Product Manager, Search

Principal Product Manager position at LinkedIn focused on leading the Search team to enhance search experience across consumer products using AI and infrastructure expertise.

Principal Staff Software Engineer

Principal Staff Software Engineer role at LinkedIn focusing on building and scaling the technology stack for LinkedIn Marketing Solutions (LMS), the largest B2B Advertising Platform.

Principal Product Manager, Trust Tools

Principal Product Manager position at LinkedIn leading Trust Tools development, requiring 10+ years experience in product management and focus on content moderation and trust/safety features.

Principal Staff Software Engineer - Validation & Resilience Engineering

Principal Staff Software Engineer role at LinkedIn focusing on validation and resilience engineering for large-scale distributed systems.

Principal Software Developer(hybrid)

Principal Software Engineer position at Oracle focusing on distributed systems and cloud infrastructure, offering competitive compensation and comprehensive benefits.

Description For Principal Engineer, GPU Platform

OpenAI is seeking a Principal Engineer for their GPU Platform team within the Applied Engineering department. This role is crucial in building and maintaining the infrastructure that supports models like ChatGPT and the API. The ideal candidate will have extensive experience in core infrastructure, GPU clusters, and orchestration systems like Kubernetes.

The role involves designing and building inference infrastructure to ensure reliability and performance, scaling systems to the next order of magnitude, and contributing to a diverse and inclusive culture. The position is based in San Francisco, with a hybrid work model of 3 days in the office per week.

Key responsibilities include:

  • Developing scalable and reliable inference infrastructure
  • Ensuring system scalability for future growth
  • Participating in on-call rotations for critical incident response
  • Contributing to a positive and inclusive work environment

The ideal candidate will have:

  • 10+ years of experience in core infrastructure
  • Expertise in running GPU clusters at scale
  • Proficiency with orchestration systems like Kubernetes
  • Ability to thrive in ambiguous and rapidly changing environments
  • Passion for building secure and scalable systems

OpenAI offers a competitive compensation package, including a salary range of $385K – $550K, equity, and comprehensive benefits such as medical, dental, and vision insurance, mental health support, 401(k) matching, unlimited time off, and paid parental leave.

Join OpenAI in their mission to ensure that general-purpose AI benefits all of humanity. They are committed to safety, responsible AI development, and creating a diverse and inclusive work environment. If you're passionate about pushing the boundaries of AI capabilities and want to make a significant impact in the field, this role offers an exciting opportunity to shape the future of technology.

Last updated 7 months ago

Responsibilities For Principal Engineer, GPU Platform

  • Design and build the inference infrastructure that power our products, enabling reliability and performance
  • Ensure our infrastructure can scale to the next order of magnitude
  • Help create a diverse, equitable, and inclusive culture that makes all feel welcome while enabling radical candor and the challenging of group think
  • Participate in on-call rotation to respond to critical incidents as needed

Requirements For Principal Engineer, GPU Platform

Kubernetes
Linux
  • 10+ years building core infrastructure
  • Experience running GPU clusters at scale
  • Experience operating orchestration systems such as Kubernetes at scale
  • Comfortable with ambiguity and rapid change
  • Take pride in building and operating scalable, reliable, secure systems

Benefits For Principal Engineer, GPU Platform

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
401k
Parental Leave
Education Budget
Equity
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (24 weeks) and family-planning support
  • Annual learning & development stipend ($1,500 per year)
  • Relocation assistance

Interested in this job?