Deep Learning Performance Architect

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Description For Deep Learning Performance Architect

NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Performance Architect to join their innovative team. This role focuses on developing GPU-accelerated Deep Learning software and optimizing deep learning kernels for inference. The position offers an opportunity to work with cutting-edge technology and collaborate with researchers worldwide who are using NVIDIA GPUs to power breakthroughs in numerous areas.

The role involves working with cross-collaborative teams across automotive, image understanding, and speech understanding domains to develop innovative solutions. You'll be responsible for performance optimization, analysis, and tuning of deep learning systems, while also having the opportunity to implement the latest algorithms for public release in Tensor-RT.

As a Deep Learning Performance Architect, you'll be part of a fast-paced, customer-oriented team where excellent communication skills are essential. The position requires strong technical expertise in C/C++ programming, GPU architecture, and deep learning frameworks. You'll have the chance to work with some of the most brilliant minds in the technology industry while contributing to NVIDIA's mission of advancing accelerated computing.

The ideal candidate should have at least 5 years of relevant experience, strong software development skills, and deep understanding of performance optimization. This role offers the opportunity to shape the future of AI and deep learning while working at one of the technology world's most desirable employers.

Last updated 13 days ago

Responsibilities For Deep Learning Performance Architect

  • Develop highly optimized deep learning kernels for inference
  • Perform performance optimization, analysis, and tuning
  • Work with cross-collaborative teams across automotive, image understanding, and speech understanding
  • Travel to conferences and customers for technical consultation and training

Requirements For Deep Learning Performance Architect

Python
  • Masters or PhD or equivalent experience in relevant discipline (CE, CS&E, CS, AI)
  • Software Agile skills
  • Excellent C/C++ programming and software design skills
  • Python experience
  • Performance modelling, profiling, debug, and code optimization knowledge
  • GPU programming experience (CUDA or OpenCL)
  • 5 years of relevant work experience

Interested in this job?

Jobs Related To NVIDIA Deep Learning Performance Architect

Developer Technology Engineer - HPC and AI

Senior Developer Technology Engineer position at NVIDIA focusing on HPC and AI, requiring 3+ years experience and advanced degree, based in Seoul.

Senior Software Engineer, LLM Inference

Senior Software Engineer position at NVIDIA focusing on LLM Inference development, requiring expertise in C++, deep learning, and AI technologies.

Senior Perception Engineer

Senior Perception Engineer role at NVIDIA developing autonomous driving solutions using deep learning and computer vision, offering competitive salary and opportunity to work on cutting-edge technology.

Senior On-Device Model Inference Optimization Engineer

Senior AI optimization role at NVIDIA focusing on improving performance and efficiency of AI models for autonomous vehicles, offering competitive salary and equity benefits.

Senior Prediction and Planning Machine Learning Engineer - Autonomous Vehicles

Senior ML Engineer position at NVIDIA focusing on prediction and planning systems for autonomous vehicles