Site Reliability Development at Google Cloud combines software and systems development to build and run large-scale, massively distributed, fault-tolerant systems. As a Site Reliability Developer II, you'll be responsible for ensuring Google Cloud's services maintain reliability and appropriate uptime while driving continuous improvement. The role involves managing complex challenges of scale unique to Google Cloud, utilizing expertise in coding, algorithms, complexity analysis, and large-scale system design.
The team culture emphasizes diversity, intellectual curiosity, problem-solving, and openness. Google brings together people with diverse backgrounds and perspectives, encouraging collaboration and risk-taking in a blame-free environment. The organization promotes self-direction on meaningful projects while providing support and mentorship for learning and growth.
You'll be working on optimizing existing systems, building infrastructure, and automating processes. Your responsibilities include managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, maintaining, and enhancing software solutions. The role requires strong technical expertise and the ability to work effectively with cross-functional teams.
Google offers a supportive work environment with opportunities for professional development and impact at scale. The company is committed to building a representative workforce and creating a culture of belonging, providing equal employment opportunities regardless of background. This role combines technical challenges with the opportunity to shape the reliability and performance of Google Cloud's critical infrastructure.