Senior AI Training Performance Engineer

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.

Shanghai, China

Machine Learning

Senior Software Engineer

Hybrid

5,000+ Employees

8+ years of experience

Description For Senior AI Training Performance Engineer

NVIDIA is seeking exceptional Senior AI Training Performance Engineers to optimize AI training workloads, one of the most critical applications in modern computing. This role combines deep technical expertise in GPU architecture, deep learning frameworks, and performance optimization to achieve maximum efficiency in AI training systems. The position offers a unique opportunity to impact both hardware and software development at NVIDIA, a leading force in the AI revolution.

The role involves working across the entire hardware/software stack, from GPU architecture to Deep Learning Frameworks, to achieve peak performance. You'll be responsible for analyzing and optimizing state-of-the-art neural networks, implementing production-quality software in NVIDIA's deep learning platform stack, and working with proprietary processor and system simulators for future architecture studies.

As part of NVIDIA's Deep Learning Architecture team, you'll be at the forefront of AI computing, working with cutting-edge technology and contributing to products that are transforming industries worldwide. The company is widely recognized as one of the technology world's most desirable employers, known for its innovative work in AI computing and accelerated computing solutions.

The ideal candidate should possess a strong academic background (PhD or MS with equivalent experience) in Computer Science or related fields, combined with extensive experience in deep learning, computer architecture, and performance optimization. This role requires expertise in programming languages such as C++, Python, and CUDA, along with a deep understanding of GPU architecture and neural network training processes.

This position offers the opportunity to work on industry-leading Deep Learning products while helping users worldwide achieve faster training speeds. You'll be joining a forward-thinking team in a company that's leading the AI revolution, making this an excellent opportunity for those passionate about performance optimization and artificial intelligence.

Last updated 4 hours ago

Responsibilities For Senior AI Training Performance Engineer

Understand, analyze, profile, and optimize AI and deep learning training workloads on state-of-the-art hardware and software platforms
Analyze training performance on GPUs across many state-of-the-art neural networks
Implement production-quality software in NVIDIA's deep learning platform stack
Implement DL training workloads in NVIDIA's proprietary processor and system simulators
Build tools to automate workload analysis, workload optimization, and other critical workflows

Requirements For Senior AI Training Performance Engineer

Python

Linux

PhD (or equivalent experience) in CS, EE or CSEE and 5+ years; or MS and 8+ years of relevant work experience
Strong background in deep learning and neural networks, particularly in training
Deep understanding of computer architecture and GPU architecture fundamentals
Proven experience analyzing and tuning application performance
Experience with processor and system-level performance modelling
Programming skills in C++, Python, and CUDA

NVIDIA

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.

Shanghai, China

Machine Learning

Senior Software Engineer

Hybrid

5,000+ Employees

8+ years of experience

Interested in this job?

Jobs Related To NVIDIA Senior AI Training Performance Engineer

Senior Applied LLM Engineer, AI – Chip Design

NVIDIA

Senior Applied LLM Engineer position at NVIDIA focusing on AI and chip design, developing and optimizing retrieval and generation algorithms for enterprise data.

Senior Performance Engineer - Deep Learning

NVIDIA

Senior Performance Engineer position at NVIDIA focusing on optimizing Deep Learning tools and frameworks.

Senior Math Libraries Engineer – AI and HPC

NVIDIA

Senior Math Libraries Engineer role at NVIDIA focusing on AI and HPC, developing GPU-accelerated mathematical libraries for cutting-edge applications.

Senior Deep Learning Performance Architect

NVIDIA

Senior Deep Learning Performance Architect position at NVIDIA focusing on developing and analyzing next-generation architectures for AI and high-performance computing applications.

Senior Deep Learning Performance Architect

NVIDIA

Senior Deep Learning Performance Architect role at NVIDIA focusing on developing high-performance, energy-efficient architectures for AI and machine learning applications.