Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Google is a global technology leader that specializes in internet-related services and products.
Site Reliability
Mid-Level Software Engineer
In-Person
5+ years of experience
Enterprise SaaS

Description For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google Cloud's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to customer's needs and a fast rate of improvement. Additionally SRE's will keep an ever-watchful eye on our systems capacity and performance. Much of our software development focuses on optimizing existing systems, building infrastructure and eliminating work through automation. On the SRE team, you'll have the opportunity to manage the complex challenges of scale which are unique to Google Cloud, while using your expertise in coding, algorithms, complexity analysis and large-scale system design. SRE's culture of diversity, intellectual curiosity, problem solving and openness is key to its success. Our organization brings together people with a wide variety of backgrounds, experiences and perspectives. We encourage them to collaborate, think big and take risks in a blame-free environment. We promote self-direction to work on meaningful projects, while we also strive to create an environment that provides the support and mentorship needed to learn and grow.

With your technical expertise, you will manage project priorities, deadlines, and deliverables. You will design, develop, test, deploy, maintain, and enhance software solutions.

Last updated 2 days ago

Responsibilities For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

  • Work on the availability, scalability, efficiency, and latency of some of Google Cloud's most critical services
  • Work with partner Development and SRE teams to design and deliver different programs and projects in a scalable, reliable, and secure manner (Cloud Capacity fungibility, Trusted Private Cloud, Convergence, Butter, etc.)
  • Design and develop innovative solutions that enable key Google initiatives that scale with the requirements of the business
  • Provide oncall and on-duty excellence; driving problems towards sustainable, long-term solutions
  • Contribute to continuous service improvement

Requirements For Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Linux
  • Bachelor's degree in Computer Science, a related field, or equivalent practical experience
  • 5 years of experience building and developing infrastructure, distributed systems or networks, or experience with compute technologies, storage, or hardware architecture
  • 2 years of experience with data structures/algorithms and software development in one or more programming languages
  • Experience with debugging and Linux

Interested in this job?

Jobs Related To Google Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Site Reliability Engineer II

Microsoft seeks a Site Reliability Engineer II for their Commerce and Ecosystems team to manage and automate large-scale platforms.

Software Developer II, Site Reliability Development, Google Cloud

Google Cloud seeks a Software Developer II for Site Reliability Development to build and maintain large-scale, fault-tolerant systems.

Software Developer II, Site Reliability Developing, Google Cloud

Google Cloud seeks a Software Developer II for Site Reliability Engineering to build and maintain large-scale, fault-tolerant systems.

Site Reliability Engineering, Transformative Compute Site Reliability Engineering

Google is seeking a Mid-Level Site Reliability Engineer to build and maintain large-scale distributed systems for Google Cloud services.