Google's Site Reliability Engineering (SRE) team is seeking a Senior Software Engineer to join their dynamic organization that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. This role is crucial in ensuring Google Cloud's services maintain reliability and uptime while continuously improving performance.
As an SRE, you'll be at the forefront of managing complex challenges unique to Google Cloud's scale. Your responsibilities will span the entire service lifecycle, from design and development to deployment and maintenance. You'll work on optimizing existing systems, building infrastructure, and creating automation solutions to eliminate manual work.
The role requires a strong background in software development, distributed systems, and technical leadership. You'll be responsible for system design consulting, capacity planning, and maintaining service health through monitoring and optimization. The position offers the opportunity to work with cutting-edge technology while solving complex problems at massive scale.
Google's SRE culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. The team brings together people with varied backgrounds and perspectives, encouraging collaboration and innovation. You'll have the freedom to work on meaningful projects while receiving the support and mentorship needed for professional growth.
The Technical Infrastructure team, which you'll be part of, is fundamental to Google's operations, developing and maintaining data centers and building next-generation platforms. This role offers the unique opportunity to work with some of the world's most complex distributed systems while contributing to the reliability and performance of Google's global infrastructure.