Senior Software Engineer, TPU Supercomputer

Google is a global technology company that builds technical infrastructure to power its product portfolio, including data centers and next-generation platforms.
Distributed Systems
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Description For Senior Software Engineer, TPU Supercomputer

Google is seeking a Senior Software Engineer to join their Technical Infrastructure team, focusing on TPU Supercomputer systems. This role is at the heart of Google's AI infrastructure, working on the architecture that powers their massive computational needs. The position involves designing and maintaining sophisticated software systems for TPU supercomputers, managing both computing and networking components of AI supercomputing systems.

The ideal candidate will bring strong expertise in C++ system software development and distributed systems, with the ability to work across different layers of the software stack. This role offers the opportunity to work with cutting-edge technology in AI and supercomputing, collaborating with various specialized teams including Silicon, Software, SRE, and Operations.

As a Senior Software Engineer in this role, you'll be responsible for creating system-level tools for debugging and observability, ensuring the smooth operation of Google's AI infrastructure. The position requires a deep understanding of distributed systems, cloud platforms, and machine learning concepts, making it an exciting opportunity for someone passionate about large-scale computing systems.

Working at Google means joining a company that values diversity, equality, and inclusion, with a strong commitment to building a representative workforce. The role offers the chance to work on technology that impacts billions of users while being part of a team that takes pride in being "engineers' engineers." If you're excited about pushing the boundaries of AI computing infrastructure and working with some of the most advanced supercomputing systems in the world, this role presents an exceptional opportunity for career growth and technical innovation.

Last updated 37 minutes ago

Responsibilities For Senior Software Engineer, TPU Supercomputer

  • Design and maintain TPU supercomputer software across different layers of the software stack
  • Manage the whole lifecycle across both computing and networking components for Google's AI supercomputer/hypercomputers
  • Create system-level debuggability and observability tools in partnership with key stakeholders
  • Collaborate with Silicon, Software, Site Reliability Engineer (SRE), and Operations teams to drive reliability improvements

Requirements For Senior Software Engineer, TPU Supercomputer

  • Bachelor's degree or equivalent practical experience
  • 5 years of experience in system software development with C++
  • 3 years of experience in distributed systems
  • Master's degree or PhD in Computer Science or related technical field (preferred)
  • Experience with production monitoring, logging, and observability tools (preferred)
  • Experience with cloud platforms and technologies (e.g. GCP) (preferred)
  • Experience with machine learning concepts and frameworks (e.g., TensorFlow) (preferred)
  • Experience with data analysis and SQL (preferred)
  • Knowledge of networking protocols and technologies (preferred)

Benefits For Senior Software Engineer, TPU Supercomputer

Medical Insurance
Vision Insurance
Dental Insurance
Parental Leave
  • Equal opportunity employer
  • Accommodation for disabilities
  • Parental benefits including breastfeeding support

Interested in this job?

Jobs Related To Google Senior Software Engineer, TPU Supercomputer

Senior Software Engineer, Infrastructure, Google Ads

Senior Software Engineer position at Google Ads, focusing on infrastructure development and distributed systems, offering competitive compensation and opportunity to impact billions of users.

Senior Software Engineer, D-SDN, Google Global Networking

Senior Software Engineer position at Google focusing on D-SDN and Global Networking, developing distributed networking applications and systems at scale.

Senior Software Engineer, Google Distributed Cloud

Senior Software Engineer position at Google Cloud, focusing on distributed systems development with competitive compensation and opportunity to work on cutting-edge cloud technologies.

Senior Software Engineer, Infrastructure, Core

Senior Software Engineer position at Google working on infrastructure and core systems, developing next-generation technologies that power Google's flagship products.

Senior Software Engineer, Project Starline

Senior Software Engineer position at Google working on Project Starline, developing revolutionary 3D communication technology that enables life-like virtual presence.