Software Engineer III, Cloud Machine Learning Compute Platforms

Google Cloud provides enterprise-grade solutions leveraging cutting-edge technology and tools for digital transformation.
$136,000 - $200,000
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI · Enterprise SaaS · Cloud

Description For Software Engineer III, Cloud Machine Learning Compute Platforms

Google Cloud is seeking a Software Engineer III to join their Cloud Machine Learning Compute Service Platforms and Experience team. This role focuses on developing and maintaining the service platform for Machine Learning Computing on Google Cloud, including Tensor Processing Units (TPUs). The position offers an opportunity to work on cutting-edge technology that accelerates machine learning workloads and reduces computational costs.

As a Software Engineer III, you'll be responsible for building and improving the accelerator platform for TPUs, designing and implementing GCP API infrastructure, and working across various stack layers. The role requires collaboration with cross-functional teams and managing project schedules while identifying technical risks.

The ideal candidate should have experience in software development, distributed systems, and API design. Knowledge of AI/ML systems and cloud platforms is highly valued. The position offers competitive compensation ($136,000-$200,000) plus bonus, equity, and benefits.

Google Cloud serves customers in over 200 countries, providing enterprise-grade solutions that leverage Google's cutting-edge technology. The company is committed to diversity, equality, and creating a culture of belonging. This role offers the opportunity to work on technologies that impact billions of users while contributing to the next generation of cloud computing and machine learning infrastructure.

Working at Google Cloud means being part of a team that's pushing the boundaries of technology, particularly in machine learning and cloud computing. You'll have the chance to work with state-of-the-art TPU technology, contribute to large-scale systems, and help shape the future of cloud-based machine learning platforms.

Last updated a month ago

Responsibilities For Software Engineer III, Cloud Machine Learning Compute Platforms

  • Build and improve the accelerator platform where the team onboards first-party machine learning TPUs to Google Cloud Platforms (GCP)
  • Design and implement features and scale the Cloud TPU GCP API infrastructure to support multiple generations of TPU hardware and products
  • Work with software engineers, product managers, technical program managers, and other engineering teams to get high-quality products and features through the software project life-cycle
  • Impact various layers of the stack including the Cloud front-end, control plane, data plane, storage, GCP horizontals, as well as system scalability and performance improvements
  • Manage project schedules, identify technical risks and communicate them to project stakeholders

Requirements For Software Engineer III, Cloud Machine Learning Compute Platforms

Python
Java
Go
Kubernetes
  • Bachelor's degree or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree
  • 2 years of experience developing infrastructure, distributed systems, or networks, or experience with compute technologies, storage or hardware architecture
  • Experience in API design and development

Benefits For Software Engineer III, Cloud Machine Learning Compute Platforms

Medical Insurance
Vision Insurance
Dental Insurance
Equity
401k
  • Medical Insurance
  • Vision Insurance
  • Dental Insurance
  • Equity
  • 401k

Interested in this job?

Jobs Related To Google Software Engineer III, Cloud Machine Learning Compute Platforms

Research Product Manager, Google Cloud, Domain Applied ML

Lead AI/ML research product management at Google Cloud, driving innovation in Generative AI and machine learning infrastructure.

Research Scientist, Market Algorithms

Research Scientist position at Google focusing on market algorithms, combining ML, economics, and computer science research with practical applications.

Research Scientist

Research Scientist position at Google Research focusing on machine learning and AI systems development, requiring PhD and programming expertise.

Software Developer III, Machine Learning, Google Cloud AI

Software Developer III position at Google Cloud AI, focusing on machine learning and AI development with opportunities to work on large-scale systems impacting billions of users.

Silicon AI/ML Architect, Google Cloud

Silicon AI/ML Architect position at Google Cloud, focusing on developing AI accelerators and custom silicon solutions for data centers.