Supercomputing, Software Engineer - Scheduling

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits all of humanity.
$360,000 - $440,000
Backend
Senior Software Engineer
Hybrid
AI

Description For Supercomputing, Software Engineer - Scheduling

OpenAI is seeking a Supercomputing, Software Engineer specializing in Scheduling to join their Supercomputing Scheduling Pillar. This team is dedicated to ensuring the reliability, scalability, and user-friendliness of job lifecycle management for some of the largest supercomputers in the world.

As a Software Engineer in this role, you will be responsible for designing, writing, deploying, and operating job lifecycle management systems for model training. The scale of this work is immense, with tight timelines and a fast-moving organization. This is an opportunity to shape a critical system in support of OpenAI's mission to ensure that general-purpose artificial intelligence benefits all of humanity.

Key responsibilities include:

  • Designing, implementing, and operating components of quota management, job scheduling, and queuing systems
  • Interfacing with researchers to understand workload requirements
  • Harmonizing job lifecycle features with cluster infrastructure, storage, and hardware health requirements

The ideal candidate will have:

  • Significant experience with hyperscale scheduling systems
  • Strong programming skills
  • Experience working in public clouds, especially Azure
  • An execution-focused mentality paired with a rigorous focus on user requirements
  • As a bonus, an understanding of AI/ML workloads

OpenAI offers a competitive compensation package, including a salary range of $360K – $440K, generous equity, and comprehensive benefits. These include medical, dental, and vision insurance, mental health support, a 401(k) plan with 50% matching, unlimited time off, paid parental leave, and an annual learning & development stipend.

This role is based in San Francisco, CA, with a hybrid work model of 3 days in the office per week. Relocation assistance is available for new employees. Join OpenAI in shaping the future of technology and ensuring that the benefits of AI are widely shared.

Last updated 6 days ago

Responsibilities For Supercomputing, Software Engineer - Scheduling

  • Design, implement and operate components of quota management, job scheduling, and queuing systems
  • Interface with researchers to understand workload requirements
  • Harmonize job lifecycle features with cluster infrastructure, storage, and hardware health requirements

Requirements For Supercomputing, Software Engineer - Scheduling

  • Significant experience with hyperscale scheduling systems
  • Strong programming skills
  • Experience working in public clouds (especially Azure)
  • Execution focused mentality paired with a rigorous focus on user requirements

Benefits For Supercomputing, Software Engineer - Scheduling

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
401k
Parental Leave
Education Budget
Equity
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (24 weeks paid birth-parent leave & 20-week paid parental leave) and family-planning support
  • Annual learning & development stipend ($1,500 per year)
  • Equity

Interested in this job?

Jobs Related To OpenAI Supercomputing, Software Engineer - Scheduling

Software Development Engineer, AWS Audit Manager

Lead software development for AWS Audit Manager, streamlining audit and compliance processes with automation for global customers.

Senior System Development Engineer, Kuiper Network Services

Senior System Development Engineer role for Project Kuiper, building software to manage ground Points of Presence for satellite broadband connectivity.

Software Development Engineer-II

Amazon is seeking a Senior Software Engineer to lead high-scale web services development for digital content commerce.

Software Development Engineer, AWS Energy Team

Join AWS as a Software Development Engineer to build sustainable cloud infrastructure and innovative renewable energy solutions.

Software Development Engineer III, Transporter Aggregation and Tracking Services (Tracks)

Senior Software Engineer role at Amazon, focusing on Last Mile delivery technology and real-time data processing for logistics optimization.