Google Cloud is seeking a Software Engineer III for their Site Reliability Engineering (SRE) team. This role combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and uptime while continuously improving performance and capacity.
The position offers unique challenges in managing complex systems at Google's scale. You'll apply your expertise in coding, algorithms, complexity analysis, and large-scale system design to optimize existing systems, build infrastructure, and automate processes. The role involves writing product code, reviewing others' code, contributing to documentation, troubleshooting issues, and participating in design reviews.
Google's SRE culture values diversity, intellectual curiosity, and problem-solving in a collaborative, blame-free environment. You'll work with a diverse team, tackling meaningful projects with the support and mentorship needed to grow your skills.
Key responsibilities include developing system code, ensuring best practices through code reviews, maintaining documentation, resolving complex issues, and contributing to technical decision-making. The ideal candidate will have a strong background in Computer Science or a related field, with experience in software development and distributed systems.
Join Google Cloud's SRE team to push the boundaries of large-scale system reliability and performance, while working in an innovative and supportive environment that fosters both personal and professional growth.