Google's Site Reliability Engineering (SRE) team is at the forefront of maintaining and optimizing large-scale, distributed systems that power Google Cloud's services. As a Software Engineer in the Traffic Trust SRE team focusing on DoS Infrastructure, you'll be responsible for ensuring the reliability, security, and performance of critical systems. The role combines software and systems engineering to build and maintain fault-tolerant systems at massive scale.
You'll work on optimizing existing systems, building infrastructure, and automating processes to eliminate manual work. The position offers unique challenges of scale specific to Google Cloud, where you'll apply your expertise in coding, algorithms, complexity analysis, and large-scale system design. The team particularly values diversity, intellectual curiosity, and problem-solving in a blame-free environment.
The role involves hands-on work with security-relevant services, monitoring systems, and distributed configuration delivery systems. You'll also serve as a consultant to other teams on DoS prevention, security, capacity planning, and reliability. Being part of an on-call rotation, you'll respond to significant incidents and help maintain the robust infrastructure that powers Google's services.
Google offers a collaborative environment where you'll work with professionals from diverse backgrounds and perspectives. The company promotes self-direction while providing support and mentorship for continuous learning and growth. This position is perfect for someone who enjoys tackling complex technical challenges, has a strong foundation in systems engineering, and wants to work at the intersection of reliability and security at a global scale.