Software Engineer III, Site Reliability Engineering, Google Cloud

Google is a global technology leader that specializes in internet-related services and products, including cloud computing, software, and hardware.
Site Reliability
Mid-Level Software Engineer
Contact Company
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineering (SRE) at Google Cloud combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing system capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automation. You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. The team values diversity, intellectual curiosity, and problem-solving in a blame-free environment. You'll collaborate with professionals from diverse backgrounds, taking calculated risks and working on meaningful projects. The role offers strong support and mentorship for continuous learning and growth. Your technical expertise will be crucial in managing project priorities, deadlines, and deliverables, as well as designing, developing, testing, deploying, maintaining, and enhancing software solutions. Join a culture that promotes self-direction while providing the support needed to tackle complex distributed systems challenges.

Last updated 19 hours ago

Responsibilities For Software Engineer III, Site Reliability Engineering, Google Cloud

  • Write product or system development code
  • Review code developed by other engineers and provide feedback to ensure best practices
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies

Requirements For Software Engineer III, Site Reliability Engineering, Google Cloud

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with software development in one or more programming languages
  • 2 years of experience with data structures or algorithms
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and to automate routine tasks
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills

Interested in this job?

Jobs Related To Google Software Engineer III, Site Reliability Engineering, Google Cloud

Software Developer III, Site Reliability Development, Google Cloud

Site Reliability Developer role at Google Cloud focusing on building and maintaining large-scale distributed systems with competitive compensation and growth opportunities.

Technical Program Manager, Site Reliability Engineering

Technical Program Manager position at Google's SRE team, leading infrastructure and service delivery projects with focus on operational excellence and cross-functional collaboration.

Program Manager, Platforms and Devices Site Reliability Engineering

Lead complex technical programs for Google's Platforms and Devices SRE team, managing cross-functional projects and driving organizational efficiency.

Site Reliability Engineer

Site Reliability Engineer position at Google Dublin, focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Shopping Build Site Reliability Engineer

Site Reliability Engineer role at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.