Software Engineer, Model Inference

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.
$200,000 - $370,000
Backend
Senior Software Engineer
In-Person
501 - 1,000 Employees
3+ years of experience

Description For Software Engineer, Model Inference

OpenAI is seeking a Software Engineer for Model Inference to join their Applied AI team. This role focuses on scaling up critical inference infrastructure to efficiently service customer requests for state-of-the-art AI models like GPT-4 and DALL-E.

Key responsibilities include:

  • Collaborating with ML researchers, engineers, and product managers to productionize latest technologies
  • Improving performance, latency, throughput, and efficiency of deployed models
  • Building tools for visibility into bottlenecks and addressing high-priority issues
  • Optimizing code and Azure VMs to maximize hardware utilization

The ideal candidate should have:

  • Understanding of modern ML architectures and optimization for inference
  • At least 3 years of professional software engineering experience
  • Expertise in HPC technologies (InfiniBand, MPI, CUDA)
  • Experience with production distributed systems
  • Ability to work on end-to-end problems and learn new skills as needed

OpenAI offers a competitive compensation package, including equity and comprehensive benefits such as medical insurance, 401(k) matching, unlimited time off, and parental leave. The company values diversity and is committed to creating an inclusive environment for all employees.

Join OpenAI in shaping the future of AI technology and ensuring its benefits are widely shared.

Last updated 4 months ago

Responsibilities For Software Engineer, Model Inference

  • Work with ML researchers, engineers, and product managers to bring latest technologies into production
  • Improve performance, latency, throughput, and efficiency of deployed models
  • Build tools for visibility into bottlenecks and address high-priority issues
  • Optimize code and Azure VMs to maximize hardware utilization

Requirements For Software Engineer, Model Inference

Python
  • At least 3 years of professional software engineering experience
  • Expert in core HPC technologies: InfiniBand, MPI, CUDA
  • Experience architecting, observing, and debugging production distributed systems
  • Understanding of modern ML architectures and optimization for inference
  • Ability to own problems end-to-end and learn new skills as needed

Benefits For Software Engineer, Model Inference

Medical Insurance
Dental Insurance
Vision Insurance
401k
Education Budget
Parental Leave
Mental Health Assistance
  • Medical, dental, and vision insurance for you and your family
  • Mental health and wellness support
  • 401(k) plan with 50% matching
  • Unlimited time off and 13 company holidays per year
  • Paid parental leave (20 weeks) and family-planning support
  • Annual learning & development stipend ($1,500 per year)

Interested in this job?

Jobs Related To OpenAI Software Engineer, Model Inference

Sr ECAD Application Engineer, Project Kuiper Satellites

Senior ECAD Tools Application Engineer position at Amazon's Project Kuiper, focusing on satellite constellation development and ECAD tool management.

System Development Engineer, Private Pricing Product Management (3PM)

Senior Systems Development Engineer role at AWS focusing on Private Pricing Product Management, building scalable solutions and tools using modern technologies.

Senior Product Manager - Tech

Lead Amazon's Buy Now checkout experience as Senior Product Manager, driving innovation in e-commerce with competitive compensation and comprehensive benefits.

Senior Software Development Engineer, AWS Alameda

Senior Software Engineer role at AWS Alameda, focusing on control plane development and distributed systems with 5+ years of experience required.

Software Dev Engineer (L5), Global Talent Management & Compensation

Senior Software Engineer role at Amazon's Edinburgh office, building scalable talent management solutions using AWS technologies.