Google is seeking a Technical Program Manager for Machine Learning Operations and Maintenance to join their Central Operations team. This role involves managing complex, multi-disciplinary projects related to data center operations, with a focus on Machine Learning workload dependencies, maintenance policies, and global strategies for shutdown/turnaround maintenance.
Key responsibilities include:
- Documenting ML workload dependencies on power and cooling infrastructure
- Developing and implementing Maintenance SLO policies for Data Center Operations
- Creating a global strategy for shutdown/turnaround maintenance
- Implementing a planned downtime communications solution for internal and external Cloud customers
- Collaborating with partner teams to implement programmatic changes in various processes
The ideal candidate will have:
- A Bachelor's degree in a relevant field or equivalent practical experience
- 8+ years of experience in critical operations, global change management, or technical program management
- Experience managing multiple vendors and external partners in a 24x7 environment
- Knowledge of electrical/power and mechanical/cooling engineering
- Experience with global change governance and maintenance in data centers
- Strong problem-solving and data analytics skills
- Ability to travel 40-50% of the time as needed
Google offers a competitive salary range of $168,000-$252,000 plus bonus, equity, and benefits. They are committed to diversity, equity, and inclusion, aiming to build a workforce that represents the users they serve. This role provides an opportunity to work on cutting-edge technology and contribute to Google Cloud's mission of accelerating digital transformation for organizations worldwide.