Staff Software Engineer, Google Kubernetes Engine AI Training

Google Cloud delivers enterprise-grade solutions leveraging cutting-edge technology and tools for developers, serving customers in more than 200 countries.
Machine Learning
Staff Software Engineer
In-Person
8+ years of experience
AI · Enterprise SaaS · Cloud

Description For Staff Software Engineer, Google Kubernetes Engine AI Training

Google Cloud is seeking a Staff Software Engineer to lead the evolution of Google Kubernetes Engine for AI Training. This role combines deep technical expertise in distributed systems, AI/ML, and cloud infrastructure with technical leadership responsibilities. You'll work at the cutting edge of cloud technology, helping shape how billions of users interact with Google's services.

As a technical leader, you'll drive high-impact projects that define the future of AI model training on Kubernetes. You'll be responsible for designing and implementing large-scale software solutions that power Google Cloud's AI infrastructure. The role requires both technical depth in AI frameworks and tools (like TensorFlow, PyTorch, and Kubeflow) and the ability to lead and mentor engineering teams.

The position offers the opportunity to work with world-class engineers, tackle complex technical challenges in distributed computing and AI, and directly impact how organizations worldwide leverage Google Cloud for their AI initiatives. You'll be part of Google Cloud's mission to accelerate digital transformation across industries, working with cutting-edge technology in a collaborative, innovation-driven environment.

This role is perfect for experienced engineers who are passionate about AI infrastructure, have a proven track record of technical leadership, and want to shape the future of cloud computing. You'll have the chance to work on projects that directly influence how the world's most demanding AI models are trained and deployed at scale.

Last updated a month ago

Responsibilities For Staff Software Engineer, Google Kubernetes Engine AI Training

  • Provide technical leadership on high-impact projects
  • Facilitate alignment and clarity across teams on goals, outcomes, and timelines
  • Design, develop, test, deploy, maintain, and enhance large-scale software solutions
  • Drive the technical vision and roadmap for Kubernetes evolution to meet the needs of GenAI models training
  • Inspire, mentor and support a talented team of engineers

Requirements For Staff Software Engineer, Google Kubernetes Engine AI Training

Kubernetes
Python
  • Bachelor's degree or equivalent practical experience
  • 8 years of experience in software development, and with data structures/algorithms
  • 5 years of experience testing, and launching software products
  • 3 years of experience with software design and architecture
  • Experience with AI frameworks (e.g., Tensorflow, PyTorch, JAX), algorithms and tools(e.g., Kubeflow, Ray.io, MLflow)
  • Master's degree or PhD in Engineering, Computer Science, or related technical field (preferred)
  • 3 years of experience in technical leadership role (preferred)
  • 3 years of experience working in complex, matrixed organizations (preferred)
  • MLOps experience with large-scale distributed environments (preferred)

Interested in this job?

Jobs Related To Google Staff Software Engineer, Google Kubernetes Engine AI Training

Senior Research Scientist, Interactive Recommender Systems

Senior Research Scientist position at Google Research focusing on interactive recommender systems, machine learning, and AI, offering competitive compensation and benefits.

Staff Research Scientist, Google Cloud AI

Lead AI research scientist position at Google Cloud, focusing on advancing AI technology and its applications across industries while contributing to the research community.

Staff Software Developer, Generative AI, Gemini Code Assist

Lead the development of AI-powered developer tools at Google's Gemini Code Assist team, focusing on machine learning and generative AI applications.

Product Manager, AI/ML, Google Cloud

Lead AI/ML product management at Google Cloud, developing strategic vision for ML hardware stack and collaborating with teams like DeepMind and YouTube.

Senior Research Scientist, Multilingual NLP

Senior Research Scientist position at Google focusing on multilingual NLP and LLMs, requiring PhD and 7+ years of experience in machine learning and natural language processing.