Senior Software Engineer, Deep Learning Inference - Windows

NVIDIA is the world leader in accelerated computing, pioneering solutions to tackle challenges no one else can solve.
$148,000 - $276,000
Machine Learning
Senior Software Engineer
Hybrid
5+ years of experience
AI

Description For Senior Software Engineer, Deep Learning Inference - Windows

At NVIDIA, we're at the forefront of innovation, driving advancements in AI and machine learning to solve some of the world's most challenging problems. We're seeking talented and motivated engineers to join our TensorRT team in developing the industry-leading deep learning inference software for NVIDIA AI accelerators.

As a Senior Software Engineer in the TensorRT team, you will be responsible for designing and implementing inference optimizations to enable real-time AI applications on personal computing devices with NVIDIA AI accelerators (GPU, DLA). You will work closely with cross-functional teams to integrate and deploy AI solutions in production environments. Your expertise will help shape the performance, functionality, and efficiency of our AI models and systems.

Key responsibilities include:

  • Designing, implementing, and optimizing TensorRT components for Generative AI inference applications
  • Developing software in C++, Python, CUDA, and DirectML to accelerate AI model deployment
  • Collaborating with deep learning experts and GPU architects throughout the company

We're looking for candidates with:

  • A strong background in Computer Science or related field (BS, MS, or PhD)
  • 5+ years of software development experience on large codebases
  • Expertise in C++ and Python programming
  • Experience with Deep Learning Frameworks, Compilers, or System Software
  • Knowledge of Machine Learning techniques and GPU optimizations

Additional skills that will set you apart:

  • Experience in developing DirectML backend for GPU or NPU
  • Windows application and middleware development using DirectX or DirectML API
  • Knowledge of GPU programming using CUDA or OpenCL
  • Experience deploying AI models in production environments

Join NVIDIA and be part of a team that's shaping the future of AI and accelerated computing. You'll work on challenging projects, collaborate with industry experts, and have the opportunity to make a significant impact in a company that values creativity, excellence, and innovation.

Last updated 2 months ago

Responsibilities For Senior Software Engineer, Deep Learning Inference - Windows

  • Design, implement and optimize TensorRT components for Generative AI inference applications
  • Develop software in C++, Python, CUDA, and DirectML to accelerate AI model deployment
  • Collaborate with deep learning experts and GPU architects

Requirements For Senior Software Engineer, Deep Learning Inference - Windows

Python
  • BS, MS, PhD or equivalent experience in Computer Science, Computer Engineering or related field
  • 5+ years of software development experience on a large codebase or project
  • Strong proficiency in C++ and Python programming languages
  • Experience with development of: Deep Learning Frameworks, Compilers, or System Software
  • Foundational knowledge of Machine Learning techniques or GPU optimizations
  • Excellent problem-solving skills
  • Strong communication skills

Benefits For Senior Software Engineer, Deep Learning Inference - Windows

Equity
  • Equity

Interested in this job?

Jobs Related To NVIDIA Senior Software Engineer, Deep Learning Inference - Windows

Senior Software Engineer - Conversational AI

Senior Software Engineer position at NVIDIA focusing on building next-generation Conversational AI systems and Digital Human solutions using advanced Speech and LLM models.

Senior Software Engineer, Deep Learning Inference

Senior Software Engineer role at NVIDIA focusing on optimizing deep learning inference performance and implementing AI runtime solutions.

Senior System Software Engineer, Deep Learning Accelerator

Senior System Software Engineer role at NVIDIA focusing on Deep Learning Accelerator development, requiring 7+ years of experience in low-level software development and system architecture.

Deep Learning Engineer, End-to-end - Autonomous Driving

Senior Deep Learning Engineer position at NVIDIA focusing on end-to-end autonomous driving solutions, combining AI expertise with automotive technology.

Senior Software Engineer, TensorRT-LLM

Senior Software Engineer position at NVIDIA focusing on TensorRT-LLM development, requiring expertise in C++, deep learning, and AI inferencing optimization.