Software Engineer III, Cloud Machine Learning Compute Platforms

Google Cloud provides enterprise-grade solutions leveraging cutting-edge technology and tools for digital transformation.
$136,000 - $200,000
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI · Enterprise SaaS · Cloud

Description For Software Engineer III, Cloud Machine Learning Compute Platforms

Google Cloud is seeking a Software Engineer III to join their Cloud Machine Learning Compute Service Platforms and Experience team. This role focuses on developing and maintaining the service platform for Machine Learning Computing on Google Cloud, including Tensor Processing Units (TPUs). The position offers an opportunity to work on cutting-edge technology that accelerates machine learning workloads and reduces computational costs.

As a Software Engineer III, you'll be responsible for building and improving the accelerator platform for TPUs, designing and implementing GCP API infrastructure, and working across various stack layers. The role requires collaboration with cross-functional teams and managing project schedules while identifying technical risks.

The ideal candidate should have experience in software development, distributed systems, and API design. Knowledge of AI/ML systems and cloud platforms is highly valued. The position offers competitive compensation ($136,000-$200,000) plus bonus, equity, and comprehensive benefits.

Google Cloud serves customers in over 200 countries, providing enterprise-grade solutions that leverage Google's cutting-edge technology. The company is committed to diversity, equality, and creating a culture of belonging. This role offers the opportunity to work on technologies that impact billions of users while contributing to the next generation of cloud computing and machine learning infrastructure.

Working at Google Cloud means being part of a team that's pushing the boundaries of technology, with opportunities to switch teams and projects as you grow. You'll be empowered to innovate and act like an owner, anticipating customer needs and taking action to drive technology forward.

Last updated 3 months ago

Responsibilities For Software Engineer III, Cloud Machine Learning Compute Platforms

  • Build and improve the accelerator platform where the team onboards first-party machine learning TPUs to Google Cloud Platforms (GCP)
  • Design and implement features and scale the Cloud TPU GCP API infrastructure to support multiple generations of TPU hardware and products
  • Work with software engineers, product managers, technical program managers, and other engineering teams to get high-quality products and features through the software project life-cycle
  • Impact various layers of the stack including the Cloud front-end, control plane, data plane, storage, GCP horizontals, as well as system scalability and performance improvements
  • Manage project schedules, identify technical risks and communicate them to project stakeholders

Requirements For Software Engineer III, Cloud Machine Learning Compute Platforms

Python
Java
Go
  • Bachelor's degree or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree
  • 2 years of experience developing infrastructure, distributed systems, or networks, or experience with compute technologies, storage or hardware architecture
  • Experience in API design and development

Benefits For Software Engineer III, Cloud Machine Learning Compute Platforms

Medical Insurance
Dental Insurance
Vision Insurance
Equity
Parental Leave
  • Competitive base salary
  • Bonus
  • Equity
  • Comprehensive benefits package
  • Parental benefits

Interested in this job?

Jobs Related To Google Software Engineer III, Cloud Machine Learning Compute Platforms

Research Scientist, Market Algorithms, Google Research

Research Scientist position at Google Research focusing on market algorithms, combining economics, ML, and algorithm design to develop efficient marketplaces.

Field Solution Architect II, AI Infrastructure, North, Google Cloud

Field Solution Architect II position at Google Cloud, focusing on AI infrastructure implementation and customer advisory for enterprise AI solutions.

Technical Program Manager III, Hardware and Software Systems, Google Cloud

Technical Program Manager III position at Google Cloud, leading hardware and software systems initiatives with focus on machine learning and AI development.

Field Solution Architect II, AI Infrastructure, South, Google Cloud

Enterprise AI Infrastructure Field Solution Architect position at Google Cloud, focusing on implementing AI accelerator solutions and helping customers optimize their cloud infrastructure.

Research Scientist

Research Scientist position at Google Research, focusing on machine learning and AI development with competitive compensation and opportunities for publication.