Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

World leader in accelerated computing, pioneering AI and digital twins technology.
$144,000 - $270,250
Cloud
Senior Software Engineer
Remote
5+ years of experience
AI · Enterprise SaaS

Description For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

NVIDIA is seeking experienced Software Engineers for their DGX Cloud team to support AI training and inference development. The role focuses on building and running private and public clouds at production scale, developing platforms, tools, and services for bare-metal accelerated compute infrastructure. The position requires expertise in cloud infrastructure, automation, and distributed systems.

The ideal candidate will have strong programming skills in Python or Go, extensive experience with infrastructure automation, and deep knowledge of cloud technologies. They'll be responsible for designing and maintaining cloud services, implementing automation, and ensuring system reliability. The role involves working with cutting-edge AI and accelerated computing technologies.

NVIDIA, as a leader in groundbreaking developments in AI, High-Performance Computing, and Visualization, offers an exciting opportunity to work with state-of-the-art technology. The company is known for inventing the GPU and is at the forefront of artificial intelligence and autonomous vehicles development. They offer competitive compensation, including equity, and value diversity in their workforce.

This role is perfect for someone who is passionate about large-scale infrastructure, automation, and distributed systems, with a desire to work on technology that powers the future of AI and computing. The position offers flexibility with remote work options and the chance to work with some of the most innovative technologies in the industry.

Last updated 28 minutes ago

Responsibilities For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

  • Design, build, and run cloud infrastructure services
  • Define internal facing service level objectives and error budgets
  • Eliminate toil and implement automation
  • Practice sustainable blameless incident prevention and response
  • Consult with peer teams on systems design best practices
  • Participate in on-call rotation

Requirements For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Python
Go
Kubernetes
Linux
  • BS degree in Computer Science or related technical field
  • 5+ years of experience in infrastructure and fleet management engineering
  • Proficiency in Python or Go
  • Experience with infrastructure automation and distributed systems design
  • Track record of project initiation and collaboration
  • In-depth knowledge of Linux, Slurm, Kubernetes, Storage, and Systems Networking

Benefits For Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Equity
  • Equity

Interested in this job?

Jobs Related To NVIDIA Senior DGX Cloud Software Engineer - Infrastructure Automation and Distributed Systems

Senior Software Engineer, Bare Metal Automation - DGX Cloud

Senior Software Engineer position at NVIDIA focusing on bare metal automation for DGX Cloud, managing large-scale GPU clusters and implementing monitoring systems.

Senior DGX Cloud Software Engineer- AI NeoCloud Infrastructure Automation

Senior cloud engineering role at NVIDIA focusing on AI infrastructure automation and distributed systems, offering competitive compensation and remote work options.

Senior System Software Engineer - Scientific Computing PaaS

Senior System Software Engineer role at NVIDIA focusing on building scientific computing platform on DGX Cloud.

Senior Software Engineer - HPC

Senior Software Engineer position at NVIDIA focusing on HPC infrastructure development and management using cloud technologies.

Senior Software Engineer - Cloud Efficiency

Senior Cloud Engineer role at NVIDIA focusing on cloud infrastructure optimization, requiring 12+ years of experience and offering competitive compensation between $200K-$391K.