NVIDIA is seeking a passionate, motivated and technical Kubernetes Architect/Engineer to join its Infrastructure, Planning and Processes organization as a Principal DevOps & SRE Engineer. You'll support the design and implementation of Kubernetes solutions for the company's Cloud Platform.
You'll be part of a fast-paced team developing and maintaining sophisticated build & test environments for NVIDIA GPUs, Tegra Processors, and various operating systems. The role involves working with different business units within NVIDIA Software, including Graphics Processors, Mobile Processors, Deep Learning, Artificial Intelligence, Robotics, and Autonomous cars.
Key Responsibilities:
- Architect, design, implement & maintain Kubernetes environments for CI/CD pipeline
- Design solutions for service discovery, networking, monitoring, logging, and scheduling in Kubernetes
- Ensure platform reliability, scalability, and resistance to disruptions
- Participate in product workshops, roadmap, and design sessions
- Lead technical demos and working sessions
- Defend architectural designs in front of the DevSecOps review board
- Develop automations to improve efficiency & productivity
- Participate in on-call support and critical issue coverage as an SRE engineer
- Prototype and develop cloud infrastructure for NVIDIA
Requirements:
- Kubernetes domain expertise with experience in building scalable, resilient platforms
- High proficiency in administering and configuring Kubernetes
- Programming background in Python or similar scripting languages
- Experience maintaining cloud infrastructure and highly available production environments
- Proficiency in CI/CD tools and infrastructure-as-code (ansible, puppet, chef & terraform)
- Experience with databases (SQL and NoSQL) and monitoring tools
- 8+ years of proven experience
- Bachelor's or Master's degree in Computer Science, Software Engineering, or equivalent experience
Preferred Qualifications:
- Kubernetes certifications (CKA, CKS, CKAD)
- Understanding of containerization and microservices architecture
- Experience with large-scale operations teams and data centers
- Strong problem-solving and system design skills
NVIDIA offers competitive salaries, generous benefits, and is considered one of the most desirable employers in the technology world. Join our team of forward-thinking and hardworking professionals in this rapidly growing field.