Software Engineer III, Cloud Machine Learning Compute Platforms

Google Cloud provides enterprise-grade solutions leveraging cutting-edge technology and tools for digital transformation.
$136,000 - $200,000
Machine Learning
Mid-Level Software Engineer
In-Person
5,000+ Employees
2+ years of experience
AI · Enterprise SaaS · Cloud

Description For Software Engineer III, Cloud Machine Learning Compute Platforms

Google Cloud is seeking a Software Engineer III to join their Cloud Machine Learning Compute Service Platforms and Experience team. This role focuses on developing and maintaining the service platform for Machine Learning Computing on Google Cloud, including Tensor Processing Units (TPUs). The position offers an opportunity to work on cutting-edge technology that accelerates machine learning workloads and reduces computational costs.

As a Software Engineer III, you'll be responsible for building and improving the accelerator platform for TPUs, designing and implementing GCP API infrastructure, and working across various stack layers. The role requires collaboration with cross-functional teams and managing project schedules while identifying technical risks.

The ideal candidate should have experience in software development, distributed systems, and API design. Knowledge of AI/ML systems and cloud platforms is highly valued. This position offers competitive compensation ($136,000-$200,000 + bonus + equity + benefits) and the opportunity to work with cutting-edge technology that impacts billions of users.

Google Cloud accelerates organizations' digital transformation across industries, serving customers in more than 200 countries. The company is committed to diversity, equality, and creating a culture of belonging. This role provides an excellent opportunity to work on next-generation technologies while contributing to Google Cloud's mission of enabling growth and solving critical business problems for customers worldwide.

Join a team that's pushing the boundaries of machine learning infrastructure and help shape the future of cloud computing at one of the world's most innovative companies. You'll have the chance to work with state-of-the-art TPU technology while collaborating with talented engineers and researchers in a dynamic, fast-paced environment.

Last updated a month ago

Responsibilities For Software Engineer III, Cloud Machine Learning Compute Platforms

  • Build and improve the accelerator platform where the team onboards first-party machine learning TPUs to Google Cloud Platforms (GCP)
  • Design and implement features and scale the Cloud TPU GCP API infrastructure to support multiple generations of TPU hardware and products
  • Work with software engineers, product managers, technical program managers, and other engineering teams to get high-quality products and features through the software project life-cycle
  • Impact various layers of the stack including the Cloud front-end, control plane, data plane, storage, GCP horizontals, as well as system scalability and performance improvements
  • Manage project schedules, identify technical risks and communicate them to project stakeholders

Requirements For Software Engineer III, Cloud Machine Learning Compute Platforms

Python
Java
Go
  • Bachelor's degree or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree
  • 2 years of experience developing infrastructure, distributed systems, or networks, or experience with compute technologies, storage or hardware architecture
  • Experience in API design and development

Benefits For Software Engineer III, Cloud Machine Learning Compute Platforms

Medical Insurance
Dental Insurance
Vision Insurance
401k
Equity
  • Competitive base salary
  • Bonus
  • Equity
  • Comprehensive benefits package

Interested in this job?

Jobs Related To Google Software Engineer III, Cloud Machine Learning Compute Platforms

Research Product Manager, Google Cloud, Domain Applied ML

Lead AI/ML research product management at Google Cloud, driving innovation in Generative AI and machine learning infrastructure.

Research Scientist, Market Algorithms

Research Scientist position at Google focusing on market algorithms, combining ML, economics, and computer science research with practical applications.

Research Scientist

Research Scientist position at Google Research focusing on machine learning and AI systems development, requiring PhD and programming expertise.

Software Developer III, Machine Learning, Google Cloud AI

Software Developer III position at Google Cloud AI, focusing on machine learning and AI development with opportunities to work on large-scale systems impacting billions of users.

Silicon AI/ML Architect, Google Cloud

Silicon AI/ML Architect position at Google Cloud, focusing on developing AI accelerators and custom silicon solutions for data centers.