Principal Architect Cloud Infrastructure

NVIDIA is the world leader in accelerated computing, pioneering solutions for AI and digital twins that transform industries and society.
$272,000 - $419,750
Cloud
Principal Software Engineer
Hybrid
5,000+ Employees
12+ years of experience
AI · Enterprise SaaS

Description For Principal Architect Cloud Infrastructure

NVIDIA is seeking a Principal Architect to work on a scalable hybrid cloud system for infrastructure services across multiple teams. This role involves crafting scalable cloud solutions to handle millions of jobs and thousands of systems, working with various NVIDIA groups such as Graphics Processors, Mobile Processors, Deep Learning, AI, and Autonomous Vehicles. The cloud services will run on thousands of servers, supporting a heterogeneous mix of machines with various operating systems and hardware platforms.

Key Responsibilities:

  • Design creative, scalable cloud solutions for millions of jobs and thousands of systems
  • Tackle challenging infrastructure problems in areas like NIMs, Kubernetes, job scheduling, and resource management
  • Develop observability solutions to enhance system availability, reliability, and latency
  • Collaborate with customers to understand needs and create innovative solutions

Requirements:

  • Experience in architecting scalable cloud infrastructure solutions
  • Expertise in Kubernetes
  • Strong object-oriented programming skills (Java or Go preferred)
  • Ability to collaborate across multiple teams and time zones
  • Bachelor's degree or equivalent experience
  • Strong software/hardware engineering background
  • 12+ years of experience in infrastructure

Preferred Qualifications:

  • Experience in design, implementation, and deployment of major infrastructure features
  • Knowledge of AI/ML and Data Analytics applied to Infrastructure
  • Experience with large-scale, multi-cluster Kubernetes environments
  • Ability to design robust distributed systems for heterogeneous platforms

NVIDIA offers a competitive base salary range of $272,000 - $419,750 USD, along with equity and comprehensive benefits. Join NVIDIA to work with the most talented people in the world and advance Artificial Intelligence through innovative infrastructure solutions.

Last updated 3 months ago

Responsibilities For Principal Architect Cloud Infrastructure

  • Craft creative scalable cloud solutions to scale to millions of jobs and thousands of systems
  • Tackle challenging problems in areas such as NIMs, Kubernetes, job scheduling, resource management, and automated recovery
  • Build observability solutions to measure and improve system availability, reliability, and latency
  • Work with customers to understand their needs and develop innovative solutions

Requirements For Principal Architect Cloud Infrastructure

Kubernetes
Java
Go
  • Experience in architecting scalable cloud infrastructure solutions
  • Expertise in Kubernetes
  • Strong object-oriented programming background (Java or Go preferred)
  • Ability to collaborate across multiple teams and time zones
  • Bachelor's degree or equivalent experience
  • Strong software/hardware engineering background
  • 12+ years of experience in infrastructure

Benefits For Principal Architect Cloud Infrastructure

Equity
  • Equity
  • Comprehensive benefits package

Interested in this job?

Jobs Related To NVIDIA Principal Architect Cloud Infrastructure

Data Center System Software Architect, DGX Cloud

Lead architect position for NVIDIA's DGX Cloud platform, focusing on next-generation data center systems and AI infrastructure solutions.

Director of Engineering, Cloud and Database Platforms

Lead NVIDIA's cloud and database platforms strategy, managing infrastructure and teams for one of the world's leading AI and computing companies.

Senior Network Architect

Senior Network Architect position at NVIDIA, leading network architecture design and implementation for AI and high-performance computing infrastructure.

Principal Systems Software Engineer - Cloud Infrastructure and Development

Lead cloud infrastructure development at NVIDIA using OpenStack and Kubernetes, shaping the future of AI and digital twins.

HPC Operations Manager – Hardware Engineering

NVIDIA seeks an HPC Operations Manager to lead global HPC clusters for hardware design teams, focusing on reliability, technology evaluation, and team leadership.