Google is seeking a Software Engineer III to join their Technical Infrastructure team, specifically focusing on GPU Accelerator development for the Google Cloud Platform. This role is crucial in maintaining and enhancing the core software that runs on Google's massive production fleet, supporting the latest hardware advancements from CPUs and GPUs to custom-designed TPUs.
The position requires strong expertise in system software integration and development, particularly for next-generation GPU accelerators in Google's data centers. You'll be working with kernel drivers, firmware development, and system architecture, requiring both deep technical knowledge and innovative problem-solving skills.
As a member of the Google System (gSys) team, you'll be at the forefront of developing and maintaining the infrastructure that powers Google's extensive product portfolio. The role combines low-level system programming with high-impact infrastructure development, making it perfect for engineers passionate about system architecture and performance optimization.
Key responsibilities include driving system software integration, developing kernel drivers, writing detailed specifications, and creating comprehensive test suites. You'll also work closely with vendors, influencing their solutions to better integrate with Google's ecosystem.
The ideal candidate should have at least 2 years of experience in C/C++ programming, with additional expertise in Python and Linux kernel development being highly valuable. This role offers the opportunity to work on cutting-edge technology at massive scale, making a direct impact on Google's infrastructure that serves billions of users worldwide.
Google offers a collaborative environment where engineers can grow and tackle complex technical challenges. The position provides the chance to work with advanced hardware technologies and contribute to the foundation of Google's technical infrastructure, making it an exciting opportunity for engineers interested in system-level software development and large-scale computing.