Google Cloud's Site Reliability Engineering (SRE) team combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing complex challenges of scale. The role involves optimizing existing systems, building infrastructure, and automating processes.
The Technical Infrastructure team is responsible for the architecture that powers Google's product portfolio, from developing and maintaining data centers to building next-generation Google platforms. The team takes pride in being the engineers' engineers, focusing on keeping networks running optimally for the best user experience.
You'll work in a culture that values diversity, intellectual curiosity, and problem-solving in a blame-free environment. The organization brings together people with diverse backgrounds and perspectives, encouraging collaboration and big-picture thinking. You'll have the opportunity to work on meaningful projects with the support and mentorship needed for professional growth.
The role combines technical expertise in distributed systems, software development, and infrastructure management with a focus on reliability and scalability. You'll be involved in the entire service lifecycle, from design to deployment and optimization, while working with cutting-edge technology at a global scale. The position offers the chance to impact millions of users while working with some of the most complex and innovative technology infrastructure in the industry.