Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE in the Ads Quality Infrastructure team, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while monitoring system capacity and performance. The role involves optimizing existing systems, building infrastructure, and automating processes.
You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. The SRE team values intellectual curiosity, problem-solving, and openness, bringing together diverse backgrounds and perspectives. Google encourages collaboration, big-picture thinking, and risk-taking in a blame-free environment.
The position offers opportunities for self-direction on meaningful projects while providing support and mentorship for professional growth. You'll manage project priorities, deadlines, and deliverables, while designing, developing, testing, deploying, maintaining, and enhancing software solutions. The role combines technical expertise with system reliability to ensure Google's advertising infrastructure operates at peak efficiency.
Working at Google means joining a company committed to diversity, equality, and creating a culture of belonging. You'll be part of a global team that values English proficiency for effective collaboration while maintaining high standards of technical excellence and innovation in distributed systems management.