Google Cloud is seeking a Software Engineer specializing in ML/AI Reference Models to join their Technical Infrastructure team. This role focuses on shaping the future of AI/ML hardware acceleration, particularly working with TPU (Tensor Processing Unit) technology that powers Google's most demanding AI/ML applications. The position involves developing functional and performance models for ML compute IPs and integrating them with the Cloud TPU SoC model.
As part of a diverse team, you'll be pushing boundaries in custom silicon solutions that power Google's TPU. Your responsibilities will include working closely with ML and SoC architecture teams to understand instruction sets and architecture of ML IPs in detail. You'll collaborate with pre-silicon, post-silicon, and software teams to implement these models in their validation flows, contributing to the delivery of high-quality designs for next-generation data center accelerators.
The role is based in Google's Technical Infrastructure team, which is fundamental to maintaining and developing Google's data centers and platforms. This team takes pride in being the engineers' engineers, focusing on keeping networks running optimally and ensuring the best possible user experience.
This position offers an exciting opportunity to work at the intersection of hardware and machine learning, contributing to cutting-edge technology that powers Google's AI applications. The ideal candidate will combine technical expertise in hardware modeling with a strong understanding of machine learning architectures, making this role perfect for someone passionate about advancing AI hardware acceleration technology.
Working at Google also means being part of a company that values diversity, equality, and inclusion, with a strong commitment to building a representative workforce and creating a culture of belonging. The role offers the chance to work on impactful projects while being part of a supportive and innovative team environment.