Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while focusing on continuous improvement. The role involves managing complex challenges unique to Google Cloud's scale, utilizing expertise in coding, algorithms, complexity analysis, and large-scale system design.
The position emphasizes software development for system optimization, infrastructure building, and work automation. You'll be part of a diverse team that values intellectual curiosity, problem-solving, and openness. Google's SRE culture brings together individuals with varied backgrounds and perspectives, encouraging collaboration and risk-taking in a blame-free environment.
Working in Zürich, you'll collaborate with development teams in California and Bangalore, contributing to critical Google Cloud services. The role offers opportunities for self-direction on meaningful projects while providing support and mentorship for professional growth. You'll be responsible for maintaining system capacity, performance, and developing automation to prevent issues.
This position is ideal for someone who combines technical expertise with strong problem-solving abilities and communication skills. You'll work at the intersection of software development and systems engineering, making a direct impact on Google Cloud's infrastructure while being part of a supportive, learning-oriented team culture.