Data Center System Software Architect, DGX Cloud

World leader in accelerated computing, pioneering AI and digital twins technology.
$180,000 - $339,250
Cloud
Principal Software Engineer
Remote
10+ years of experience
AI · Enterprise SaaS

Description For Data Center System Software Architect, DGX Cloud

NVIDIA is seeking a Data Center System Software Architect for their DGX Cloud team to lead the architecture, design, and implementation of next-generation DGX cloud clusters. This role combines deep technical expertise in system software with cloud infrastructure innovation. The position requires leading hybrid cloud deployments, orchestrating workloads, and optimizing application performance across NVIDIA's AI infrastructure.

The ideal candidate will bring 10+ years of experience and advanced education in Computer Science or related fields, with expertise spanning Linux systems, Python programming, and high-level languages. They'll work at the intersection of hardware and software, leading technical activities for data centers while focusing on hybrid deployments between cloud and on-premises environments.

This is an opportunity to shape the future of AI infrastructure at NVIDIA, the world leader in accelerated computing. The role offers competitive compensation between $180,000-$339,250 plus equity, and allows for remote work options. You'll be part of a team advancing NVIDIA's capacity to build and deploy leading infrastructure solutions for AI-based applications that impact core data science.

The position requires both technical depth and leadership skills, as you'll be translating requirements into vision, architecture, and roadmap while working across multiple engineering teams. Experience with GPU deep learning, container orchestration, and NVIDIA's AI software stack would be particularly valuable. Join NVIDIA in pushing the boundaries of technological advancement in AI and cloud computing.

Last updated 18 hours ago

Responsibilities For Data Center System Software Architect, DGX Cloud

  • Lead technical activities for data centers with focus on hybrid deployments between cloud and on-prem
  • Provide expertise in infrastructure workflows, including hardware, workload orchestration and application tuning
  • Provide fast and creative solutions for complex problems and write effective architecture specification
  • Translate requirements to vision, architecture and roadmap
  • Work with engineering teams across NVIDIA to ensure software integrates seamlessly from hardware to AI training applications

Requirements For Data Center System Software Architect, DGX Cloud

Python
Go
Rust
Linux
Kubernetes
  • Masters or PhD in Computer Science, Computer Engineering, Physics or equivalent experience
  • 10+ years of experience in this field
  • Data Sciences, Deep Learning, or Machine Learning coursework
  • Ability to work with Linux system environments and Python programming
  • Programming skills in high-level languages (C, C++, Go, Rust etc)
  • System-level experience with both hardware and software
  • Strong problem-solving and customer-facing communication skills
  • Strong design, coding, analytical, debugging skills
  • Passion for continuous learning and ability to work with multiple groups

Benefits For Data Center System Software Architect, DGX Cloud

Equity
  • Equity

Interested in this job?

Jobs Related To NVIDIA Data Center System Software Architect, DGX Cloud

Principal Systems Software Engineer - Cloud Infrastructure and Development

Lead cloud infrastructure development at NVIDIA using OpenStack and Kubernetes, shaping the future of AI and digital twins.

Principal Architect Cloud Infrastructure

NVIDIA seeks Principal Architect for scalable hybrid cloud infrastructure, offering competitive salary and benefits.

HPC Operations Manager – Hardware Engineering

NVIDIA seeks an HPC Operations Manager to lead global HPC clusters for hardware design teams, focusing on reliability, technology evaluation, and team leadership.

Principal Software Engineering Manager

Principal Software Engineering Manager position at Microsoft Security, leading cloud security platform development in Bangalore, requiring 12+ years of experience in software engineering and cloud technologies.

Senior Director, Technical Program Management: Cloud Economics and Capacity Management

Senior Director TPM role leading Cloud Economics and Capacity Management teams at Salesforce, overseeing data center operations and infrastructure programs.