Senior DevOps Engineer

NVIDIA is the world leader in accelerated computing, pioneering solutions in AI and digital twins.
DevOps
Senior Software Engineer
In-Person
8+ years of experience
AI · Enterprise SaaS

Description For Senior DevOps Engineer

NVIDIA, the world leader in accelerated computing, is seeking a Senior DevOps Engineer to join their Farm team. This role focuses on improving growing services infrastructure and requires a passionate individual dedicated to operational excellence. The position involves working with a diverse team of skilled engineers on critical infrastructure management and automation tasks.

The role combines hands-on technical work with strategic infrastructure planning, requiring expertise in multiple programming languages, cloud services, and modern DevOps practices. You'll be responsible for maintaining high-performance computing environments, implementing monitoring solutions, and ensuring system reliability.

Key technical areas include Linux systems, container orchestration, cloud platforms, and automation tools. The ideal candidate will have strong experience with CI/CD pipelines, infrastructure as code, and modern monitoring solutions like Grafana and Prometheus.

This position offers the opportunity to work on large-scale systems at a company at the forefront of AI and accelerated computing. You'll be part of a team that values continuous improvement and innovation, with chances to work on cutting-edge technology and contribute to NVIDIA's mission of transforming major industries through AI and digital twins.

Last updated 7 days ago

Responsibilities For Senior DevOps Engineer

  • Own services and work with cross-functional teams
  • Perform frequent code testing and deployment
  • Improve infrastructure provisioning and management using automation
  • Identify areas to improve service resiliency
  • Support globally distributed on-prem environment (LSF)
  • Determine root-cause for production incidents and write RCA reports
  • Ensure highest level of up-time and Quality of Service
  • Participate in team's on-call rotation

Requirements For Senior DevOps Engineer

Python
Go
Linux
Kubernetes
  • B.S. degree in Computer Science or related technical field or equivalent experience
  • 8+ years coding/scripting in high level programming languages
  • Experience with web applications, databases, APIs, and cloud platforms
  • Knowledge of operating services including web servers, load balancers, databases
  • Deep understanding of Linux operating system and TCP/IP fundamentals
  • Experience with cloud services (AWS, GCP, Azure)
  • Proficiency in monitoring tools like Grafana and Prometheus
  • Expertise in CI/CD, GitOps and Infrastructure as Code
  • Strong communication and documentation skills

Interested in this job?

Jobs Related To NVIDIA Senior DevOps Engineer

Senior HPC DevOps Engineer

Senior HPC DevOps Engineer role at NVIDIA focusing on building and maintaining large-scale supercomputers and HPC clusters for AI and GPU computing advancement.

Senior DevOps and Automation Engineer, Fabric Networking - GPU

Senior DevOps role at NVIDIA focusing on GPU cluster management, automation, and infrastructure development for high-performance computing systems.

Senior CUDA Driver, Legate, and Build Engineer

Senior DevOps role at NVIDIA focusing on CUDA driver development and build system automation, offering competitive compensation and opportunity to work with cutting-edge technology.

Senior Enterprise Software Test Development Engineer

Senior Enterprise Software Test Development Engineer role at NVIDIA, focusing on automation, DevOps, and quality assurance for enterprise server platforms.

Senior Release Engineer - Server Software

Senior Release Engineer position at NVIDIA focusing on server software release management and automation.