Google's Core Machine Learning team is seeking a Principal Engineer to lead the Machine Learning strategy for the Borg Control Plane team. This role is crucial in shaping the capabilities and infrastructure necessary to advance Google's ML roadmap. The position involves working with Google's cutting-edge AI platforms, including TensorFlow, JAX, and TPU systems, while collaborating with various teams across platform, storage, data center, networking, and resource management.
The role requires a seasoned professional with extensive experience in distributed systems and ML infrastructure. You'll be responsible for driving new capabilities and supporting the growth and efficient usage of Google's fleet. A key aspect of the position involves partnering with leads from various Google product areas, including Deepmind, Search, Ads, and YouTube, to accelerate the transition of research innovations to production.
As a Principal Engineer, you'll be working at the intersection of infrastructure and machine learning, helping to deliver GPUs and Google's advanced internal technology (TPUs) to external customers via Google Cloud Platform. The position offers a competitive compensation package, including a base salary range of $294,000-$414,000, plus bonus, equity, and comprehensive benefits.
The ideal candidate will have at least 15 years of software engineering experience (or 13 with an advanced degree), with a strong background in large-scale distributed systems and ML infrastructure. You should be comfortable leading complex programs, defining strategic roadmaps, and working cross-functionally with various stakeholders. This role presents an exceptional opportunity to impact Google's AI infrastructure and shape the future of machine learning capabilities at one of the world's leading technology companies.