Software Engineering Manager II, Namespaces Site Reliability Engineering

Google is a global technology company that builds and runs large-scale, distributed systems and services.
$180,000 - $300,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS · AI

Description For Software Engineering Manager II, Namespaces Site Reliability Engineering

Google's Site Reliability Engineering (SRE) team is seeking a Software Engineering Manager II to lead their Namespaces SRE division. This role combines software and systems engineering to build and maintain Google's massive distributed systems. The position focuses on managing the durability, integrity, and efficiency of Google's planet-scale storage fleet, supporting cluster-level filesystems and global namespace services with data replication between clusters.

The role requires a strong technical background with 8 years of experience in data structures and algorithms, along with proven leadership skills managing engineering teams. You'll be responsible for leading a team of Software/Systems Engineers, ensuring service reliability, and implementing automation to prevent system issues. The position involves managing on-call rotations across different time zones and designing solutions to improve Google's service availability and efficiency.

As part of Google's Technical Infrastructure team, you'll work on critical systems that support Google's entire product portfolio, including Cloud and AI/ML initiatives. The role offers the opportunity to work with complex challenges at Google's scale while collaborating with diverse teams in an environment that encourages intellectual curiosity and innovation.

This is an excellent opportunity for experienced engineering leaders who are passionate about distributed systems, have strong problem-solving abilities, and want to make an impact on systems that serve billions of users. The role offers the chance to work with cutting-edge technology while leading and developing a team of talented engineers in a supportive and growth-oriented environment.

Last updated 3 days ago

Responsibilities For Software Engineering Manager II, Namespaces Site Reliability Engineering

  • Lead a team of Software/Systems Engineers on projects for users and be responsible for uptime
  • Own end-to-end availability and performance of key services and build automation to prevent problem recurrence
  • Manage on-call rotations across continents, using a follow-the-sun model
  • Design, write and deliver software to improve the availability, scalability, latency and efficiency of Google's services

Requirements For Software Engineering Manager II, Namespaces Site Reliability Engineering

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 8 years of experience with data structures or algorithms
  • 5 years of experience with software development in one or more programming languages
  • 3 years of people management experience
  • Experience designing, analyzing, and troubleshooting distributed systems

Interested in this job?

Jobs Related To Google Software Engineering Manager II, Namespaces Site Reliability Engineering

Technical Program Manager, Site Reliability

Technical Program Manager position at Google, leading Site Reliability initiatives for AI, Trust and Security platforms, requiring 8+ years of program management and SRE experience.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and infrastructure while ensuring service reliability and performance.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering team at Google, managing distributed systems and service reliability while mentoring engineers and driving technical excellence.

Site Reliability Manager, Core Enterprise Systems

Lead a team of Site Reliability Engineers at Google, managing enterprise services and ensuring system reliability and scalability through technical excellence and innovation.

Software Engineering Manager II, Site Reliability Engineering

Lead Site Reliability Engineering teams at Google, managing distributed systems and infrastructure while ensuring service reliability and performance at global scale.