Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
$148,000 - $276,000
Cloud
Senior Software Engineer
Remote
5,000+ Employees
5+ years of experience
AI · Enterprise SaaS

Description For Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

NVIDIA, the world leader in accelerated computing, is seeking a Senior DGX Cloud Software Engineer to join their Infrastructure Automation and Distributed Systems team. This role focuses on building and running private and public clouds at production scale, ensuring reliable operation of internal and external facing cloud services infrastructure for accelerated computing.

The position offers an opportunity to work with cutting-edge technology in AI and digital twins, while being part of a team that transforms major industries. You'll be responsible for designing and implementing cloud infrastructure services, defining service level objectives, and driving automation initiatives. The role combines technical expertise in cloud technologies with strategic thinking about infrastructure scalability and reliability.

The ideal candidate brings 5+ years of experience in cloud infrastructure and distributed systems, with strong skills in Python, Go, or C++, and deep knowledge of technologies like Linux, Kubernetes, and containers. You'll work in an environment that values both technical excellence and collaboration, with opportunities to influence system design across the organization.

With a competitive base salary range of $148,000-$276,000, plus equity and comprehensive benefits, NVIDIA offers an attractive compensation package. The company's commitment to innovation, coupled with its position as one of technology's most desirable employers, makes this an exceptional opportunity for a senior engineer looking to make an impact in cloud infrastructure and distributed systems.

Last updated a day ago

Responsibilities For Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

  • Design, build, and run cloud infrastructure services
  • Define internal facing service level objectives and error budgets
  • Eliminate toil and implement automation
  • Practice sustainable blameless incident prevention and response
  • Consult with peer teams on systems design best practices

Requirements For Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

Python
Go
Linux
Kubernetes
  • BS degree in Computer Science or related technical field
  • 5+ years of relevant experience
  • Experience with infrastructure automation and distributed systems design
  • Experience in Python, Go or C++
  • In-depth knowledge of Linux, Slurm, Kubernetes, Networking, Storage, and Containers
  • Strong communication skills and systematic problem-solving approach

Benefits For Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

Equity
  • Equity
  • Comprehensive benefits package

Interested in this job?

Jobs Related To NVIDIA Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

Senior System Software Engineer - Scientific Computing PaaS

Senior System Software Engineer position at NVIDIA focusing on building scientific computing platform on DGX Cloud, requiring expertise in cloud computing and distributed systems.

Senior Software Engineer, Kubernetes - DGX Cloud

Senior Software Engineer position at NVIDIA focusing on Kubernetes and GPU infrastructure for DGX Cloud, offering competitive salary and opportunity to work with cutting-edge AI technology.

Senior Software Engineer, Reliability and Operational Excellence - DGX Cloud

Senior Software Engineer position at NVIDIA focusing on reliability and operational excellence for DGX Cloud services.

Senior Software Engineer, Bare Metal Automation - DGX Cloud

Senior Software Engineer position at NVIDIA focusing on bare metal automation for DGX Cloud, managing large-scale GPU clusters for AI workloads.

Senior Software Engineer - HPC

Senior Software Engineer position at NVIDIA focusing on HPC infrastructure, requiring 10+ years of experience in designing and implementing large-scale distributed systems.