Google Compute Engine (GCE) is the foundation of Google Cloud Platform, providing Infrastructure as a Service to customers. As a Staff Software Engineer on the GCE Telemetry team, you'll play a crucial role in providing customers with tools and information necessary to understand and improve compute instance observability. The team collects and stores large-scale, real-time VM Telemetry data from the entire GCE fleet.
You'll be working at the intersection of cloud infrastructure and machine learning, leading technical initiatives that improve Google Cloud's observability and reliability. This role requires deep expertise in ML infrastructure, data analysis, and system design, combined with strong leadership skills to mentor team members and collaborate across functions.
The position offers competitive compensation including base salary, bonus, equity, and comprehensive benefits. You'll be part of Google Cloud's mission to accelerate digital transformation across industries, working with cutting-edge technology that serves customers in over 200 countries.
This role is perfect for experienced engineers who are passionate about large-scale systems, machine learning, and data-driven decision making. You'll have the opportunity to shape the future of cloud infrastructure while working with some of the industry's best engineers and contributing to Google's culture of innovation and technical excellence.
The ideal candidate will bring both technical depth in ML/AI and the leadership skills necessary to drive cross-functional initiatives. You'll be responsible for building and executing the technical roadmap for GCE fleet observability, establishing data best practices, and ensuring the reliability of critical cloud infrastructure used by businesses worldwide.