Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Google is a global technology leader that specializes in internet-related services and products.
Site Reliability
Mid-Level Software Engineer
Hybrid
5,000+ Employees
2+ years of experience
Enterprise SaaS · Cloud

Description For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineering (SRE) at Google combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. As an SRE, you'll ensure Google Cloud's services maintain reliability and appropriate uptime while managing system capacity and performance. The role focuses on optimizing existing systems, building infrastructure, and automation. You'll tackle unique scaling challenges specific to Google Cloud, applying expertise in coding, algorithms, complexity analysis, and large-scale system design. Google's SRE culture emphasizes diversity, intellectual curiosity, and problem-solving in a blame-free environment. The team welcomes diverse backgrounds and perspectives, encouraging collaboration and innovation. You'll work on meaningful projects with support and mentorship for professional growth. The role involves managing technical priorities, deadlines, and deliverables, while designing, developing, testing, deploying, and enhancing software solutions. Join a team that values both technical excellence and inclusive collaboration, working on some of the most complex distributed systems challenges in the industry.

Last updated 2 days ago

Responsibilities For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

  • Write product or system development code
  • Review code developed by other engineers and provide feedback to ensure best practices
  • Contribute to existing documentation or educational content and adapt content based on product/program updates and user feedback
  • Triage product or system issues and debug/track/resolve by analyzing the sources of issues and the impact on hardware, network, or service operations and quality
  • Participate in, or lead design reviews with peers and stakeholders to decide amongst available technologies

Requirements For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Linux
Kubernetes
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages
  • Experience working in computing, distributed systems, storage, or networking
  • Expertise in designing, analyzing, and troubleshooting large-scale distributed systems
  • Ability to debug, optimize code, and automate routine tasks
  • Systematic problem-solving approach, coupled with effective verbal and written communication skills

Interested in this job?

Jobs Related To Google Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Software Developer II, Site Reliability Development, Google Cloud

Site Reliability Developer position at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and performance optimization.

Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineer position at Google focusing on building and maintaining large-scale distributed systems for Google Cloud services.

Databases Site Reliability Engineer

Site Reliability Engineer position at Google focusing on database systems, requiring expertise in programming, Linux systems, and distributed computing.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with emphasis on reliability and automation.

Software Engineer III, Site Reliability Engineering, Google Cloud

Site Reliability Engineer role at Google Cloud focusing on building and maintaining large-scale distributed systems with opportunities for both remote and in-office work in Warsaw, Poland.