Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing system capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automating processes.
You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. The team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll collaborate with professionals from various backgrounds, taking calculated risks and working on meaningful projects.
The position offers opportunities for growth through supportive mentorship while promoting self-direction. You'll manage project priorities, deadlines, and deliverables, while designing, developing, testing, deploying, maintaining, and enhancing software solutions. The role combines technical expertise with system reliability, making it perfect for engineers passionate about large-scale infrastructure and service reliability.
Google provides a hybrid workplace environment, offering flexibility between remote and in-office work. The company is committed to building an inclusive culture and provides equal employment opportunities to all candidates. This role requires English proficiency to facilitate efficient global collaboration.