Software Engineer in Systems

AI research and deployment company dedicated to ensuring general-purpose artificial intelligence benefits humanity
$360,000 - $530,000
Distributed Systems
Senior Software Engineer
Hybrid
1,000 - 5,000 Employees
5+ years of experience
AI

Description For Software Engineer in Systems

OpenAI's Platform Systems team is at the forefront of AI and distributed systems engineering, focusing on training flagship models on custom-built supercomputers. The team develops proprietary model training software, specializing in collective communication, scheduling, compute efficiency, parallelism strategies, fault tolerance, and observability. As a Software Engineer in Systems, you'll be instrumental in designing and building distributed systems for next-generation model training. The role offers a hybrid work environment in San Francisco, with comprehensive benefits including competitive salary, equity, and extensive healthcare coverage. The position requires expertise in high-performance computing and distributed systems, with opportunities to work on cutting-edge AI infrastructure. OpenAI's mission focuses on ensuring AI benefits humanity, making this role ideal for those passionate about impactful, large-scale distributed systems engineering.

Last updated an hour ago

Responsibilities For Software Engineer in Systems

  • Build systems to distribute work across massive GPU clusters efficiently
  • Design and implement methods to make training stack more efficient & scale up to next generation super computers
  • Design and implement methods to robustly train models in the presence of hardware failures
  • Build tooling to help understand problems in largest training jobs

Requirements For Software Engineer in Systems

  • Love getting into low level details about performance
  • Passionate about building stable and highly efficient distributed systems
  • Background in high performance computing and/or low level systems

Benefits For Software Engineer in Systems

Medical Insurance
Dental Insurance
Vision Insurance
Mental Health Assistance
401k
Parental Leave
Education Budget
Relocation Benefits
Equity
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Generous time off and company holidays
  • 24 weeks paid birth-parent leave & 20-week paid parental leave
  • Annual learning & development stipend ($1,500 per year)
  • Relocation assistance
  • Generous equity

Interested in this job?

Jobs Related To OpenAI Software Engineer in Systems

Software Engineer, Compute - Storage

Senior Software Engineer position at OpenAI focusing on storage infrastructure and exascale data management systems, offering competitive compensation and comprehensive benefits.

Distributed Systems Engineer, Security

Senior Distributed Systems Engineer role at OpenAI focusing on security infrastructure and system optimization for large-scale AI computing environments.

Software Engineer, Networking

Design and implement custom networking collectives for OpenAI's largest training jobs using C++ and CUDA.

Software Engineer, Distributed Systems

OpenAI is hiring a Senior Software Engineer for Distributed Systems to build large-scale data systems for AI research in San Francisco.

Software Engineer, Compute - Storage

Senior Software Engineer position at OpenAI focusing on storage infrastructure and exascale data management systems, offering competitive compensation and comprehensive benefits.