Senior AI-HPC Storage Engineer

World leader in accelerated computing, pioneering AI and digital twins technology to transform industries.
$180,000 - $339,250
Cloud
Senior Software Engineer
Hybrid
8+ years of experience
AI · Enterprise SaaS

Description For Senior AI-HPC Storage Engineer

NVIDIA, a global leader in accelerated computing and AI technology, is seeking a Senior AI-HPC Storage Engineer to join their GPU AI/HPC Infrastructure team. This role offers an exciting opportunity to work at the forefront of AI and high-performance computing, designing and implementing cutting-edge storage solutions for demanding deep learning and computational workloads.

The position combines expertise in distributed systems, storage architecture, and cloud computing, requiring both technical depth and strategic thinking. You'll be responsible for developing next-generation storage solutions that power NVIDIA's GPU compute clusters, working with state-of-the-art technology in AI and HPC environments.

As a senior engineer, you'll collaborate with researchers and developers across teams, optimizing performance for AI/HPC workloads and implementing automation solutions for large-scale infrastructure environments. The role offers exposure to cutting-edge technologies including NVIDIA GPUs, deep learning frameworks, and advanced networking solutions.

NVIDIA offers a competitive compensation package with a base salary range of $180,000 to $339,250, plus equity and comprehensive benefits. The company's commitment to innovation, coupled with its significant impact on AI and digital twins technology, makes this an exceptional opportunity for experienced engineers looking to work on challenging problems at scale.

The hybrid work environment and multiple location options (Santa Clara, Westford, or Austin) provide flexibility while working with some of the most talented professionals in the industry. This role is ideal for someone passionate about storage infrastructure, distributed systems, and the application of these technologies in AI and HPC environments.

Last updated 7 days ago

Responsibilities For Senior AI-HPC Storage Engineer

  • Research and implement distributed storage services
  • Design and implement on-prem AI/HPC infrastructure with cloud computing
  • Design scalable next-gen storage solutions for data-intensive applications
  • Develop automation tooling for infrastructure management
  • Document procedures and practices for distributed file systems
  • Collaborate with teams to understand developer workflows
  • Guide methodologies for building and deploying applications
  • Support researchers with performance analysis and optimizations
  • Perform root cause analysis and suggest corrective actions

Requirements For Senior AI-HPC Storage Engineer

Python
Linux
Kubernetes
  • Bachelor's degree in Computer Science, Electrical Engineering or related field
  • 8+ years of experience designing and operating large scale storage infrastructure
  • Experience with parallel or distributed filesystems (Lustre, GPFS)
  • Proficient in Centos/RHEL and/or Ubuntu Linux
  • Python programming and bash scripting
  • Experience with cloud environments (AWS, Azure or GCP)
  • Experience with AI/HPC cluster job schedulers
  • Understanding of container technologies
  • Experience with AI/HPC workflows using MPI

Benefits For Senior AI-HPC Storage Engineer

Equity
  • Competitive base salary
  • Equity
  • Comprehensive benefits package

Interested in this job?

Jobs Related To NVIDIA Senior AI-HPC Storage Engineer

Senior DGX Cloud Software Engineer- Infrastructure Automation and Distributed Systems

Senior Cloud Engineer role at NVIDIA focusing on infrastructure automation and distributed systems for DGX cloud services.

Senior Software Engineer, Bare Metal Automation - DGX Cloud

Senior Software Engineer position at NVIDIA focusing on bare metal automation for DGX Cloud, managing GPU clusters and implementing monitoring systems for AI infrastructure.

Senior Cloud Platform Software Engineer

Senior Cloud Platform Engineer role at NVIDIA building scalable cloud services for AI workloads, requiring 12+ years of experience in platform engineering and expertise in Kubernetes.

Senior Software Engineer, Kubernetes - DGX Cloud

Senior Kubernetes Engineer role at NVIDIA focusing on scaling AI infrastructure through cloud computing, offering competitive compensation and opportunity to work with cutting-edge GPU technology.

Senior Software Engineer, Reliability and Operational Excellence - DGX Cloud

Senior Software Engineer position focused on reliability and operational excellence for NVIDIA's DGX Cloud platform.