Senior System Software Engineer - Dynamo and Triton Inference Server

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
$184,000 - $356,500
Backend
Senior Software Engineer
Hybrid
6+ years of experience
AI · Enterprise SaaS

Description For Senior System Software Engineer - Dynamo and Triton Inference Server

NVIDIA is seeking a Senior System Software Engineer to join their GPU-accelerated deep learning software team, focusing on Dynamo and Triton Inference Server development. This role is at the forefront of AI innovation, working with technology that powers breakthroughs in image classification, speech recognition, and natural language processing. The position involves developing open-source software for AI model inference on GPUs, requiring expertise in distributed systems and machine learning.

The ideal candidate will work in a fast-paced environment, developing high-performance software components for distributed inference workloads. Key responsibilities include optimizing prediction throughput, managing asynchronous requests, and integrating cutting-edge open source technologies. The role requires strong programming skills in Rust/Python/C++ and experience with distributed systems.

NVIDIA offers a competitive compensation package, including a base salary range of $184,000 - $356,500, equity, and comprehensive benefits. The company is known for being one of technology's most desirable employers, offering the opportunity to work with passionate experts in the field of AI and deep learning. This position provides a unique opportunity to contribute to the development of real-time, efficient computing platforms driving success in AI and deep learning.

Last updated 18 hours ago

Responsibilities For Senior System Software Engineer - Dynamo and Triton Inference Server

  • Develop open source software to serve inference of trained AI models running on GPUs
  • Build robust, scalable, high performance software components for distributed inference workloads
  • Work with team leads to prioritize features and capabilities
  • Load-balance asynchronous requests across available resources
  • Optimize prediction throughput under latency constraints
  • Integrate the latest open source technology

Requirements For Senior System Software Engineer - Dynamo and Triton Inference Server

Python
Rust
Kubernetes
  • Masters or PhD or equivalent experience
  • 6+ years in Computer Science, Computer Engineering, or related field
  • Ability to work in a fast-paced, agile team environment
  • Excellent Rust/Python / C++ programming and software design skills
  • Experience with high scale distributed systems and ML systems

Benefits For Senior System Software Engineer - Dynamo and Triton Inference Server

Equity
  • Equity
  • Benefits package offered

Interested in this job?

Jobs Related To NVIDIA Senior System Software Engineer - Dynamo and Triton Inference Server

Senior ASIC Front End Infrastructure Engineer

Senior ASIC Infrastructure role at NVIDIA focusing on RTL/Verification infrastructure development for GPUs, requiring 8+ years of experience and strong technical expertise.

Senior Software Engineer - Data Center System Bringup

Senior Software Engineer role at NVIDIA focusing on data center system bringup and debugging for GPU accelerated applications.

Senior NVLink System Software Bringup Engineer - Fabric Networking GPU

Senior software engineering role at NVIDIA focusing on NVLink system software development and bringup for datacenter platforms, requiring 5+ years of experience.

Senior HPC Performance Engineer

Senior HPC Performance Engineer role at NVIDIA focusing on GPU communications libraries and networking for deep learning and high-performance computing applications.

Senior ASIC Timing Engineer

Senior ASIC Timing Engineer role at NVIDIA focusing on timing analysis and closure for GPUs, CPUs, and DPUs, offering competitive compensation and opportunity to work on cutting-edge technology.