Staff Site Reliability Engineer, Kubernetes ASE - Golang

A leading technology company that designs, develops, and sells consumer electronics, software, and services.
Site Reliability
Staff Software Engineer
In-Person
8+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer, Kubernetes ASE - Golang

Join Apple's Service Engineering team as a Staff Site Reliability Engineer and be part of something extraordinary. This role focuses on supporting and scaling cloud services that impact thousands of development and operations engineers. As an SRE, you'll be instrumental in establishing and maintaining SRE practices for private/public cloud services, ensuring uncompromising scalability, high availability, and seamless performance.

The position requires deep expertise in Kubernetes, cloud platforms, and GoLang, combining technical prowess with strong collaborative abilities. You'll work closely with developers and architects to design and implement solutions that enhance stability, security, and scalability. The role goes beyond traditional SRE work, offering opportunities to shape the future of Apple's infrastructure.

Your responsibilities will span from operating production environments to implementing innovative solutions for complex challenges. You'll be involved in everything from telemetry implementation to disaster recovery planning, making a real difference in how Apple delivers thousands of applications reliably and consistently.

The ideal candidate brings not just technical expertise but also strong communication skills and a collaborative mindset. You'll be part of a team that values innovation, quality, and attention to detail. This is an opportunity to work with cutting-edge technology while contributing to systems that directly impact Apple's global operations and customer experiences.

At Apple, your ideas have the power to shape the future of our products, services, and customer experiences. The company offers a collaborative environment where your expertise will be valued and your contributions will make a tangible impact on how thousands of applications are delivered and maintained.

Last updated a day ago

Responsibilities For Staff Site Reliability Engineer, Kubernetes ASE - Golang

  • Support and scale cloud services for thousands of development and operations engineers
  • Establish SRE practices for private/public cloud service
  • Maintain system uptime and scalability
  • Collaborate with developers and architects
  • Design and implement solutions for improved stability, security, and scalability
  • Operate, monitor, and prioritize tasks across production and non-production environments
  • Create alert handling procedures and runbooks
  • Automate service deployment and orchestration
  • Participate in capability planning, scale testing, and disaster recovery exercises

Requirements For Staff Site Reliability Engineer, Kubernetes ASE - Golang

Go
Kubernetes
  • BS or MS in Computer Science or equivalent proven experience
  • Deep understanding of Kubernetes architecture, components, and best practices
  • Experience with major public cloud providers and cloud-native services
  • Proficiency in GoLang for developing automation scripts, tools, and custom applications
  • Expertise in implementing telemetry using tools like Splunk, Grafana, and Prometheus
  • Strong focus on reliability, availability, and performance
  • Excellent interpersonal and communication skills

Interested in this job?

Jobs Related To Apple Staff Site Reliability Engineer, Kubernetes ASE - Golang

Service Reliability Engineering (SRE) Manager, Analytics

Lead SRE Manager position at Apple Services Engineering, overseeing analytics infrastructure and engineering teams for global-scale data processing systems.

Senior Engineering Program Manager, iCloud SRE, Apple Services Engineering

Senior Engineering Program Manager position at Apple Services Engineering, focusing on iCloud SRE team management and infrastructure development.

Site Reliability Engineer (SRE) - Object Storage

Senior SRE position at Apple focusing on distributed storage systems, offering competitive compensation and the opportunity to impact millions of users.

Senior Service Reliability Engineer - Apple Data Platform

Senior SRE position at Apple Services Engineering team, focusing on maintaining and scaling cloud infrastructure for Apple's digital services using Kubernetes, AWS, and GCP.

Technical Program Manager, Site Reliability Engineering

Technical Program Manager position at Google leading SRE initiatives, requiring 5+ years of program management experience and strong technical expertise.