Software Engineer, Model Inference

OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

San Francisco Bay Area, CA, USA

$200,000 - $370,000

Backend

Senior Software Engineer

In-Person

501 - 1,000 Employees

3+ years of experience

Description For Software Engineer, Model Inference

OpenAI is seeking a Software Engineer for Model Inference to join their Applied AI team. This role focuses on scaling up critical inference infrastructure to efficiently service customer requests for state-of-the-art AI models like GPT-4 and DALL-E.

Key responsibilities include:

Collaborating with ML researchers, engineers, and product managers to productionize latest technologies
Improving performance, latency, throughput, and efficiency of deployed models
Building tools for visibility into bottlenecks and addressing high-priority issues
Optimizing code and Azure VMs to maximize hardware utilization

The ideal candidate should have:

Understanding of modern ML architectures and optimization for inference
At least 3 years of professional software engineering experience
Expertise in HPC technologies (InfiniBand, MPI, CUDA)
Experience with production distributed systems
Ability to work on end-to-end problems and learn new skills as needed

OpenAI offers a competitive compensation package, including equity and comprehensive benefits such as medical insurance, 401(k) matching, unlimited time off, and parental leave. The company values diversity and is committed to creating an inclusive environment for all employees.

Join OpenAI in shaping the future of AI technology and ensuring its benefits are widely shared.

Last updated 5 months ago

Responsibilities For Software Engineer, Model Inference

Work with ML researchers, engineers, and product managers to bring latest technologies into production
Improve performance, latency, throughput, and efficiency of deployed models
Build tools for visibility into bottlenecks and address high-priority issues
Optimize code and Azure VMs to maximize hardware utilization

Requirements For Software Engineer, Model Inference

Python

At least 3 years of professional software engineering experience
Expert in core HPC technologies: InfiniBand, MPI, CUDA
Experience architecting, observing, and debugging production distributed systems
Understanding of modern ML architectures and optimization for inference
Ability to own problems end-to-end and learn new skills as needed

Benefits For Software Engineer, Model Inference

Medical Insurance

Dental Insurance

Vision Insurance

401k

Education Budget

Parental Leave

Mental Health Assistance

Medical, dental, and vision insurance for you and your family
Mental health and wellness support
401(k) plan with 50% matching
Unlimited time off and 13 company holidays per year
Paid parental leave (20 weeks) and family-planning support
Annual learning & development stipend ($1,500 per year)

OpenAI

OpenAI is an AI research and deployment company dedicated to ensuring that general-purpose artificial intelligence benefits all of humanity.

San Francisco Bay Area, CA, USA

$200,000 - $370,000

Backend

Senior Software Engineer

In-Person

501 - 1,000 Employees

3+ years of experience

Interested in this job?

Jobs Related To OpenAI Software Engineer, Model Inference

Research Infrastructure Engineer - Post-Training

OpenAI

Senior Research Infrastructure Engineer position at OpenAI, focusing on post-training systems for ChatGPT, offering competitive salary and benefits in San Francisco.

Software Engineer - API Designer

OpenAI

Senior Software Engineer role at OpenAI focusing on API design and development, offering $310K-$385K plus equity and comprehensive benefits in San Francisco.

Backend Software Engineer, Intelligent Support Engineering

OpenAI

Senior Backend Software Engineer role at OpenAI, building intelligent support systems with latest AI technologies, $310K-$385K + equity, San Francisco based.

Developer Advocate, Developer Experience - Japan

OpenAI

Senior Developer Advocate position at OpenAI in Tokyo, focusing on developer experience, community building, and technical content creation for AI platform adoption.

Supercomputing, Software Engineer - Scheduling

OpenAI

OpenAI seeks a Supercomputing Software Engineer for Scheduling to design and operate job management systems for large-scale AI model training.