Deep Learning Performance Architect

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Machine Learning
Senior Software Engineer
In-Person
5+ years of experience
AI

Description For Deep Learning Performance Architect

NVIDIA, the world leader in accelerated computing, is seeking a Deep Learning Performance Architect to join their AI performance modeling and analysis efforts. This role focuses on developing processor and system architectures that accelerate deep learning and high-performance computing applications. The position offers the opportunity to work on DL performance modeling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads.

The ideal candidate will be responsible for analyzing cutting-edge deep learning networks, developing analytical models, and influencing both software and architecture teams for NVIDIA's current and next-generation inference products. They will work on specifying hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future processor configurations.

This role requires a strong background in computer science, electrical engineering, or related fields, with extensive experience in AI models and deep learning frameworks. The position offers the chance to work with some of the most forward-thinking professionals in the technology world, contributing to NVIDIA's dynamic and innovative environment.

NVIDIA provides competitive salaries and generous benefits, making it one of the technology world's most desirable employers. The company is committed to fostering a diverse work environment and proudly maintains an equal opportunity employment policy. This role represents an excellent opportunity to be at the forefront of AI and deep learning technology advancement while working with state-of-the-art hardware and software systems.

Last updated 12 days ago

Responsibilities For Deep Learning Performance Architect

  • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities
  • Develop analytical models for deep learning networks and algorithms
  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy
  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW

Requirements For Deep Learning Performance Architect

Python
  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience
  • 5+ years work experience
  • Experience with popular AI models (e.g., LLM and AIGC models)
  • Familiar with typical deep learning SW framework (Torch/JAX/TensorFlow/TensorRT)
  • Knowledge and experience on hardware architectures for deep learning applications

Interested in this job?

Jobs Related To NVIDIA Deep Learning Performance Architect

Senior Software Engineer - Conversational AI

Senior Software Engineer position at NVIDIA focusing on building next-generation Conversational AI systems and Digital Human solutions using advanced Speech and LLM models.

Senior Software Engineer, Deep Learning Inference

Senior Software Engineer role at NVIDIA focusing on optimizing deep learning inference performance and implementing AI runtime solutions.

Senior System Software Engineer, Deep Learning Accelerator

Senior System Software Engineer role at NVIDIA focusing on Deep Learning Accelerator development, requiring 7+ years of experience in low-level software development and system architecture.

Deep Learning Engineer, End-to-end - Autonomous Driving

Senior Deep Learning Engineer position at NVIDIA focusing on end-to-end autonomous driving solutions, combining AI expertise with automotive technology.

Senior Software Engineer, TensorRT-LLM

Senior Software Engineer position at NVIDIA focusing on TensorRT-LLM development, requiring expertise in C++, deep learning, and AI inferencing optimization.