Software Engineer, Model Inference

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$200,000 - $370,000
Backend
Senior Software Engineer
In-Person
501 - 1,000 Employees
3+ years of experience

Description For Software Engineer, Model Inference

OpenAI is seeking a Software Engineer for Model Inference to join their Applied AI team. This role focuses on scaling up critical inference infrastructure to efficiently service customer requests for state-of-the-art AI models like GPT-4 and DALL-E.

Key responsibilities include:

  • Collaborating with ML researchers, engineers, and product managers to productionize latest technologies
  • Improving performance, latency, throughput, and efficiency of deployed models
  • Building tools for visibility into bottlenecks and addressing high-priority issues
  • Optimizing code and Azure VMs to maximize hardware utilization

The ideal candidate should have:

  • Understanding of modern ML architectures and optimization for inference
  • At least 3 years of professional software engineering experience
  • Expertise in HPC technologies (InfiniBand, MPI, CUDA)
  • Experience with production distributed systems
  • Ability to work on end-to-end problems and learn new skills as needed

OpenAI offers a competitive compensation package, including equity and comprehensive benefits such as medical insurance, 401(k) matching, unlimited time off, and parental leave. The company values diversity and is committed to creating an inclusive environment for all employees.

Join OpenAI in shaping the future of AI technology and ensuring its benefits are widely shared.

Last updated 5 months ago

Responsibilities For Software Engineer, Model Inference

  • Work with ML researchers, engineers, and product managers to bring latest technologies into production
  • Improve performance, latency, throughput, and efficiency of deployed models
  • Build tools for visibility into bottlenecks and address high-priority issues
  • Optimize code and Azure VMs to maximize hardware utilization

Requirements For Software Engineer, Model Inference

Python
  • At least 3 years of professional software engineering experience
  • Expert in core HPC technologies: InfiniBand, MPI, CUDA
  • Experience architecting, observing, and debugging production distributed systems
  • Understanding of modern ML architectures and optimization for inference
  • Ability to own problems end-to-end and learn new skills as needed

Benefits For Software Engineer, Model Inference

Medical Insurance
Dental Insurance
Vision Insurance
401k
Education Budget
Parental Leave
Mental Health Assistance
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (20 weeks) and family-planning support
  • Annual learning & development stipend ($1,500 per year)

Interested in this job?

Jobs Related To OpenAI Software Engineer, Model Inference

Research Infrastructure Engineer - Post-Training

Senior Research Infrastructure Engineer position at OpenAI, focusing on post-training systems for ChatGPT, offering competitive salary and benefits in San Francisco.

Software Engineer - API Designer

Senior Software Engineer role at OpenAI focusing on API design and development, offering $310K-$385K plus equity and comprehensive benefits in San Francisco.

Backend Software Engineer, Intelligent Support Engineering

Senior Backend Software Engineer role at OpenAI, building intelligent support systems with latest AI technologies, $310K-$385K + equity, San Francisco based.

Developer Advocate, Developer Experience - Japan

Senior Developer Advocate position at OpenAI in Tokyo, focusing on developer experience, community building, and technical content creation for AI platform adoption.

Supercomputing, Software Engineer - Scheduling

OpenAI seeks a Supercomputing Software Engineer for Scheduling to design and operate job management systems for large-scale AI model training.