NVIDIA, a pioneer in Accelerated Computing, is seeking a Senior Site Reliability Engineer for their GPU Cloud team. This role is part of a fast-paced SRE team managing cloud and on-prem infrastructure for High-Performance & Distributed Computing. The NVIDIA GPU cloud is a hosted platform for internal R&D teams and external AI/ML stack customers, spanning thousands of GPU nodes.
As a Senior SRE, you will:
Key requirements:
NVIDIA offers a diverse work environment and is an equal opportunity employer. They do not discriminate based on race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other protected characteristic.
Join NVIDIA to work at the forefront of AI, autonomous vehicles, robotics, HPC, gaming/visualization, and cloud computing, contributing to breakthrough technologies that are transforming industries and society.