Staff Software Engineer, Google Kubernetes Engine AI Training

Google Cloud delivers enterprise-grade solutions leveraging cutting-edge technology and tools for developers, serving customers in more than 200 countries.
Machine Learning
Staff Software Engineer
In-Person
8+ years of experience
AI · Enterprise SaaS · Cloud

Description For Staff Software Engineer, Google Kubernetes Engine AI Training

Google Cloud is seeking a Staff Software Engineer to lead the evolution of Google Kubernetes Engine for AI Training. This role combines deep technical expertise in distributed systems, AI/ML, and cloud infrastructure with technical leadership responsibilities. You'll work at the cutting edge of cloud technology, helping shape how billions of users interact with Google's services.

As a technical leader, you'll drive high-impact projects that are critical to Google Cloud's strategy in the AI/ML space. You'll be responsible for designing and implementing large-scale software solutions that power the training of the world's most demanding GenAI models. The role requires both technical depth in AI frameworks and tools, as well as the ability to lead and mentor teams.

The position offers the opportunity to work with cutting-edge technology in AI and cloud computing, while being part of Google's influential cloud division that serves customers across 200+ countries. You'll collaborate with talented engineers across Google, helping shape the future of cloud computing and AI infrastructure.

This role is perfect for experienced engineers who are passionate about AI/ML infrastructure, have a strong background in distributed systems, and want to make a significant impact on how the world's most advanced AI models are trained and deployed. You'll be at the forefront of solving complex technical challenges while helping build and mentor high-performing engineering teams.

Last updated 3 months ago

Responsibilities For Staff Software Engineer, Google Kubernetes Engine AI Training

  • Provide technical leadership on high-impact projects
  • Facilitate alignment and clarity across teams on goals, outcomes, and timelines
  • Design, develop, test, deploy, maintain, and enhance large-scale software solutions
  • Drive the technical vision and roadmap for Kubernetes evolution to meet the needs of GenAI models training
  • Inspire, mentor and support a talented team of engineers

Requirements For Staff Software Engineer, Google Kubernetes Engine AI Training

Kubernetes
Python
  • Bachelor's degree or equivalent practical experience
  • 8 years of experience in software development, and with data structures/algorithms
  • 5 years of experience testing, and launching software products
  • 3 years of experience with software design and architecture
  • Experience with AI frameworks (e.g., Tensorflow, PyTorch, JAX), algorithms and tools(e.g., Kubeflow, Ray.io, MLflow)
  • Master's degree or PhD in Engineering, Computer Science, or related technical field (preferred)
  • 3 years of experience in technical leadership role (preferred)
  • 3 years of experience working in complex, matrixed organizations (preferred)
  • MLOps experience with large-scale distributed environments (preferred)

Interested in this job?

Jobs Related To Google Staff Software Engineer, Google Kubernetes Engine AI Training

Senior Research Scientist, Interactive Recommender Systems

Senior Research Scientist position at Google focusing on interactive recommender systems and machine learning research.

Staff Research Scientist, Google Cloud AI

Staff Research Scientist position at Google Cloud AI, focusing on advanced AI research and development with competitive compensation and benefits.

Senior Technical Program Manager II, Machine Learning, Google Cloud

Senior Technical Program Manager position at Google Cloud, focusing on Machine Learning initiatives with 10+ years of experience required.

Senior Research Scientist, Google Cloud AI

Senior Research Scientist position at Google Cloud AI, focusing on advancing AI technology through research and practical applications across various industries.

Field Solution Architect III, AI Infrastructure, West, Google Cloud

Senior technical role focusing on AI infrastructure architecture and customer solutions at Google Cloud, combining ML expertise with cloud infrastructure knowledge.