Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. At Google, SRE ensures that services have reliability and uptime appropriate to users' needs and a fast rate of improvement while keeping an ever-watchful eye on capacity and performance. SRE is also a mindset and a set of engineering approaches to running better production systems. Much of the software development focuses on optimizing existing systems, building infrastructure, and eliminating work through automation. SREs are responsible for the big picture of how systems relate to each other, using a breadth of tools and approaches to solve a broad spectrum of problems. The role involves limiting time spent on operational work, conducting blameless postmortems, and proactively identifying potential outages. SRE's culture values diversity, intellectual curiosity, problem-solving, and openness. The organization brings together people with diverse backgrounds, experiences, and perspectives, encouraging collaboration, big thinking, and risk-taking in a blame-free environment. Google promotes self-direction to work on meaningful projects while providing support and mentorship needed to learn and grow. This role offers the opportunity to work on high-impact projects, contribute to large-scale systems, and be part of a team that values continuous improvement and innovation in engineering practices.