Software Engineer, Networking

AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$380,000 - $555,000
Distributed Systems
Senior Software Engineer
Hybrid
1,000 - 5,000 Employees
AI

Description For Software Engineer, Networking

OpenAI is seeking a Software Engineer specializing in Networking to join their Platform Networking team. This role is crucial in developing the communication infrastructure that powers OpenAI's largest AI training operations. The position involves working with cutting-edge technology in a hybrid work environment based in San Francisco.

The Platform Networking team is responsible for the collective communication stack used in OpenAI's largest training jobs. Using C++ and CUDA, the team develops novel collective communication techniques that enable efficient training of flagship models on custom-built supercomputers. This work directly impacts AI research progress at OpenAI and the field as a whole.

As a Software Engineer in Networking, you'll be designing and implementing custom networking collectives tightly integrated into the training stack. The role requires expertise in low-level performance-critical software development, with collective communication experience being a valuable addition. You'll work closely with ML researchers, optimize network transports for large-scale training jobs, and contribute to future supercomputer network designs.

The position offers competitive compensation ranging from $380K to $555K and includes benefits such as relocation assistance. OpenAI provides a hybrid work model requiring 3 days in the office per week. The company is committed to diversity, equality, and ensuring AI benefits all of humanity.

This role is perfect for someone who thrives on technical challenges, has experience with RDMA distributed algorithms, and is comfortable with low-level performance-sensitive code. You'll be at the forefront of AI infrastructure development, working on systems that enable the training of some of the most advanced AI models in the world.

Last updated 4 days ago

Responsibilities For Software Engineer, Networking

  • Collaborate closely with ML researchers to design and implement efficient collective operations in C++ and CUDA
  • Ensure that largest training jobs take full advantage of the different network transports used in supercomputers
  • Work on simulations to inform future supercomputer network designs

Requirements For Software Engineer, Networking

  • Experience with RDMA distributed algorithms
  • Comfortable writing low level performance sensitive CPU and/or GPU code
  • Familiar with network simulation techniques

Benefits For Software Engineer, Networking

Relocation Benefits
  • Relocation assistance
  • Hybrid work model (3 days in office)

Interested in this job?

Jobs Related To OpenAI Software Engineer, Networking

Senior Software Engineer, Systems Infrastructure

Senior Software Engineer position at LinkedIn focusing on building and maintaining large-scale distributed systems and infrastructure platforms that power LinkedIn's core services.

Pod Software Engineer

Senior Pod Software Engineer position at Etched, focusing on high-performance networking solutions for AI inference workloads in San Jose.

Senior Software Engineer-Distributed Inference

Senior Software Engineer position at NVIDIA focusing on distributed inference and AI performance optimization, offering competitive compensation and remote work options.

Senior Software Engineer, Systems Infrastructure

Senior Software Engineer role at LinkedIn focusing on building and maintaining large-scale distributed systems and infrastructure platforms that power LinkedIn's core services.

Senior Software Engineer, Infrastructure, Google Cloud Networking

Senior Software Engineer position at Google Cloud focusing on infrastructure and networking systems, requiring expertise in C++ and distributed systems development.