Senior Distributed Systems Engineer, AI Infrastructure

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
Distributed Systems
Senior Software Engineer
In-Person
5+ years of experience
AI · Automotive

Description For Senior Distributed Systems Engineer, AI Infrastructure

NVIDIA is seeking a Senior Distributed Systems Engineer to lead the development of their exa-scale AI infrastructure for Autonomous Vehicles. This role combines cutting-edge distributed systems work with AI applications, focusing on building the foundation for autonomous driving technology. The position requires expertise in cloud technologies, distributed storage & compute systems, and strong technical leadership skills.

The role involves architecting and developing scalable services that power AI infrastructure for deep learning platforms, handling petabyte-scale datasets, and designing next-generation dataset management services. You'll work at the intersection of distributed systems and AI, enabling smart data selection capabilities crucial for machine learning success.

As a technical leader, you'll collaborate with multiple AI teams, contributing to the platform's architecture while ensuring it meets current and future requirements. The position offers the opportunity to work on one of technology's most ambitious challenges - autonomous vehicles - with potential applications in medical imaging, data science, and genomics.

NVIDIA offers highly competitive compensation and is renowned as one of the technology industry's most desirable employers. You'll join forward-thinking teams working on state-of-the-art fields including Deep Learning, Artificial Intelligence, and Autonomous Vehicles. The role provides an excellent opportunity to impact critical projects while working with cutting-edge technology in a collaborative environment.

The ideal candidate combines strong programming skills with distributed systems expertise, security knowledge, and technical leadership experience. This position offers the chance to shape the future of AI infrastructure while working with some of the industry's most advanced technologies and talented professionals.

Last updated a month ago

Responsibilities For Senior Distributed Systems Engineer, AI Infrastructure

  • Architect and build scalable and distributed services for AI infrastructure
  • Design and build infrastructure for PB sized deep learning datasets
  • Design next generation dataset management services
  • Enable smart data selection for machine learning
  • Collaborate with AI teams to understand requirements
  • Be a technical leader on platform projects
  • Support platform users

Requirements For Senior Distributed Systems Engineer, AI Infrastructure

Go
Java
Python
Scala
Kubernetes
  • BS, MS, or PhD in Computer Architecture, Computer Science, Electrical Engineering or related field
  • 5+ years of experience in distributed systems development and design
  • Strong programming background in data structures, design patterns, OOP, and TDD
  • Experience with distributed computing and storage systems
  • Knowledge of authentication and authorization technologies
  • Advanced programming skills in distributed systems and microservices
  • Specialist programmer in Go, Java or C/C++
  • Strong interpersonal skills and ability to work with cross-functional teams
  • Track record of successful technical leadership

Interested in this job?

Jobs Related To NVIDIA Senior Distributed Systems Engineer, AI Infrastructure

Senior System Software Engineer, Distributed Systems - DGX Cloud

Senior System Software Engineer position at NVIDIA focusing on distributed systems and DGX Cloud infrastructure.

Senior Interconnect Product Engineer

Senior Interconnect Product Engineer role at NVIDIA focusing on high-speed networking solutions, requiring 5+ years of experience in network debugging and product engineering.

Senior Distributed Storage Engineer

Senior Distributed Storage Engineer role at NVIDIA focusing on building scalable storage solutions for AI/ML applications with competitive compensation and benefits.

Systems Engineer, Enterprise

Senior Systems Engineer role at NVIDIA focusing on enterprise HPC server deployments, requiring 6+ years experience in system engineering and Linux expertise.

Senior Distributed Acceleration Engineer, RAPIDS

Senior Distributed Systems Engineer role at NVIDIA, focusing on GPU-accelerated data science and analytics pipelines, offering competitive compensation and remote work options.