Senior AI-HPC Storage Engineer

World leader in accelerated computing, pioneering AI and digital twins technology.
$184,000 - $356,500
Distributed Systems
Senior Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI · Enterprise SaaS

Description For Senior AI-HPC Storage Engineer

NVIDIA, a global leader in accelerated computing and AI technology, is seeking a Senior AI-HPC Storage Engineer to join their GPU AI/HPC Infrastructure team. This role presents an exciting opportunity to work at the forefront of storage solutions for deep learning and high-performance computing workloads.

The position involves designing and implementing groundbreaking fast storage solutions that enable demanding deep learning and HPC workloads. As a senior engineer, you'll be responsible for identifying architectural changes across file, block, and object storage systems, while helping to solve strategic challenges in storage design for large-scale, high-performance workloads.

The ideal candidate brings 8+ years of experience in large-scale storage infrastructure and deep expertise in parallel/distributed filesystems. You'll work with cutting-edge technologies, including AI/HPC clusters, container platforms, and cloud infrastructure. The role offers exposure to NVIDIA's innovative work in AI and digital twins, which is transforming major industries.

NVIDIA offers a highly competitive salary range of $184,000 to $356,500, along with equity and comprehensive benefits. You'll be joining a company known for continuous innovation, having evolved from inventing the GPU to revolutionizing parallel computing and igniting the modern AI era. The position provides an opportunity to work with some of the most talented people in the industry while solving complex technical challenges that matter to the world.

The role is available in prime tech hubs including Santa Clara, Westford, and Austin, offering flexibility in location while working on-site with state-of-the-art infrastructure. This is an excellent opportunity for someone passionate about storage systems and high-performance computing to make a significant impact at a company that's driving the future of technology.

Last updated a month ago

Responsibilities For Senior AI-HPC Storage Engineer

  • Research and implement distributed storage services
  • Design and implement on-prem AI/HPC infrastructure with cloud computing
  • Design scalable and efficient next-gen storage solutions
  • Develop tooling for automation and management of large-scale infrastructure
  • Document procedures and practices for distributed file systems
  • Collaborate across teams to understand developer workflows
  • Guide methodologies for building, testing, and deploying applications
  • Support researchers with performance analysis and optimizations
  • Perform root cause analysis and suggest corrective actions

Requirements For Senior AI-HPC Storage Engineer

Python
Linux
Kubernetes
  • Bachelor's degree in Computer Science, Electrical Engineering or related field
  • 8+ years of experience in large scale storage infrastructure
  • Experience with parallel/distributed filesystems (Lustre, GPFS)
  • Proficiency in Centos/RHEL/Ubuntu Linux
  • Python programming and bash scripting skills
  • Experience with cloud storage solutions (AWS, Azure, GCP)
  • Experience with AI/HPC cluster job schedulers
  • Understanding of container technologies
  • Experience with AI/HPC workflows using MPI

Benefits For Senior AI-HPC Storage Engineer

Equity
  • Equity
  • Benefits Package

Interested in this job?

Jobs Related To NVIDIA Senior AI-HPC Storage Engineer

Senior Software Engineer, GPU Communications and Networking

Senior Software Engineer role at NVIDIA focusing on GPU Communications and Networking, developing high-performance computing systems and deep learning frameworks.

Senior Software Engineer - HPC

Senior Software Engineer position at NVIDIA focusing on HPC infrastructure, requiring 10+ years of experience in distributed systems and cloud computing.

Systems Engineer, Enterprise

Senior Systems Engineer position at NVIDIA focusing on enterprise HPC server deployment, requiring 6+ years experience and strong hardware/software expertise.

Senior System Software Engineer, Distributed Systems - DGX Cloud

Senior System Software Engineer position at NVIDIA focusing on distributed systems and DGX Cloud infrastructure.

Senior System Software Engineer, Metropolis

Senior System Software Engineer role at NVIDIA Metropolis division, focusing on scalable Digital Twin and Synthetic Data Generation solutions with competitive compensation.