Principal Engineer, Platform and Scale, Vertex AI

Google Cloud is a trusted partner providing enterprise-grade solutions leveraging cutting-edge technology across 200+ countries.
Machine Learning
Principal Software Engineer
In-Person
5,000+ Employees
15+ years of experience
AI · Enterprise SaaS · Cloud

Description For Principal Engineer, Platform and Scale, Vertex AI

Google Cloud is seeking a Principal Engineer to lead the Vertex Generative AI Serving platform, a crucial initiative that will shape AI experiences across Google Cloud's global user base. This role combines technical leadership with strategic vision, requiring 15 years of engineering leadership experience and deep expertise in generative AI technologies.

The position involves leading a multi-site engineering organization responsible for the performance, reliability, and efficiency of production AI models. You'll collaborate with Vertex platform teams, Google research, and CoreML teams to build and evolve the LM serving platform. The role demands both technical excellence and leadership skills, as you'll be setting strategic direction, making critical technology choices, and driving platform adoption across Google.

As a Principal Engineer, you'll be at the forefront of Google Cloud's mission to accelerate digital transformation across industries. You'll work with cutting-edge AI technology, influencing how enterprises worldwide implement and benefit from generative AI solutions. The position offers the opportunity to shape the future of AI serving infrastructure while working with some of the industry's best minds.

The ideal candidate combines deep technical knowledge with strong leadership abilities, capable of inspiring teams, driving engineering excellence, and translating complex developer needs into exceptional software solutions. You'll be joining a company committed to innovation and inclusion, with the chance to impact how AI technology is deployed and used across the globe.

This role represents a unique opportunity to lead transformative AI initiatives at one of the world's leading technology companies, working on problems that affect millions of users and shape the future of cloud computing and artificial intelligence.

Last updated 3 days ago

Responsibilities For Principal Engineer, Platform and Scale, Vertex AI

  • Own and set the strategic roadmap, resource allocation, execution and delivery through critical technology choices
  • Lead a growing engineering organization across multiple sites
  • Lead, collaborate, and influence across the broader Vertex platform teams, Google research and CoreML teams to build up the LM serving platform
  • Evolve the strategic direction for the organization as the set of challenges and opportunities for Google rapidly changes
  • Provide executive leadership to define the product vision, roadmap, and execution strategy
  • Influence key stakeholders across the company, advocating for resources and driving adoption of the platform

Requirements For Principal Engineer, Platform and Scale, Vertex AI

Python
Java
Kubernetes
  • Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience
  • 15 years of experience in software engineering leadership roles and building developer-facing products or platforms
  • Experience with generative AI (LLMs, diffusion models, etc.), development workflows, and use cases
  • Experience in technical leadership, leading global projects and setting technical direction for teams
  • Experience driving engineering excellence through testing, monitoring, and continuous improvement
  • Strong understanding of Agile methodologies and software development lifecycle
  • Understanding of developer needs
  • Proven and dynamic leader with the ability to set a vision and inspire those around them

Interested in this job?

Jobs Related To Google Principal Engineer, Platform and Scale, Vertex AI

Product Manager, TPU

Lead product strategy and development for Google's Tensor Processing Unit (TPU) ML infrastructure, working with internal and external customers to drive innovation in machine learning capabilities.

Silicon AI/ML Lead Architect

Lead the architecture and development of next-generation AI accelerators for Google Cloud's data center infrastructure.

Product Manager, TPU

Lead product strategy and development for Google's Tensor Processing Unit (TPU) Machine Learning infrastructure, working with internal and external customers to drive ML capabilities.

Silicon AI/ML Lead Architect

Lead Architect position focusing on developing AI/ML silicon solutions and accelerators for Google Cloud's data center infrastructure.

Silicon AI/ML Architect, TPU, Google Cloud

Senior Silicon AI/ML Architect position at Google, focusing on TPU architecture and development for next-generation AI hardware acceleration.