Senior Service Reliability Engineer - Apple Data Platform

Apple Services Engineering team powers the App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books, delivering entertainment in over 35 languages to more than 150 countries.
Site Reliability
Staff Software Engineer
In-Person
5+ years of experience
Enterprise SaaS

Description For Senior Service Reliability Engineer - Apple Data Platform

Apple Services Engineering (ASE) is seeking a Senior Service Reliability Engineer to join their dynamic team that powers major Apple services including App Store, Apple TV, Apple Music, Apple Podcasts, and Apple Books. This role is crucial in maintaining the infrastructure that serves entertainment to millions of users across 150+ countries in 35+ languages.

As an SRE, you'll be at the forefront of ensuring mission-critical cloud systems maintain constant uptime and scale seamlessly. You'll work with cutting-edge technologies including AWS, GCP, and Kubernetes, while collaborating with developers and architects to improve stability, security, and scalability of Apple's service infrastructure.

The ideal candidate will be self-motivated with a strong attention to detail and excellence. You'll be responsible for both operational support and strategic implementation of infrastructure improvements. Your role will involve automating deployments, orchestrating services, and ensuring robust system reliability across multiple cloud environments.

This position offers a unique opportunity to work at the intersection of technology and entertainment, contributing to services that impact millions of users daily. You'll be part of a small, multi-functional team that maintains Apple's commitment to privacy while pushing the boundaries of what's possible in cloud services.

The role combines hands-on technical work with strategic planning, requiring expertise in modern DevOps practices, cloud platforms, and programming languages like Golang and Python. You'll have the chance to work on large-scale systems while maintaining the agility and impact of a smaller team environment.

Join Apple's Services Engineering team to help shape the future of digital entertainment while working with some of the most advanced cloud infrastructure in the industry. This role offers the perfect blend of technical challenge and real-world impact, making it an ideal opportunity for an experienced SRE looking to make a difference at global scale.

Last updated 22 days ago

Responsibilities For Senior Service Reliability Engineer - Apple Data Platform

  • Operate, monitor, and triage production and non-production environments
  • Automate deployment and orchestration of services in cloud environment
  • Work on multiple cloud environments like AWS and GCP
  • Participate in capacity planning, scale testing, and disaster recovery exercises
  • Support partner teams including engineering, QA, and program management
  • Maintain relationships with internal and external third-party vendors
  • Design and implement RESTful/RPC API and services using Golang or Python

Requirements For Senior Service Reliability Engineer - Apple Data Platform

Go
Python
Kubernetes
  • Bachelor's Degree in Computer Science, engineering-related field, or equivalent experience
  • 5+ years in Service Reliability Engineering, DevOps, or Infrastructure focused role
  • Expert-level experience with Kubernetes and AWS
  • Experience with multiple cloud environments (AWS and GCP)
  • Proficiency in Golang and Python
  • Experience designing and implementing RESTful/RPC API and services

Interested in this job?

Jobs Related To Apple Senior Service Reliability Engineer - Apple Data Platform

Site Reliability Engineer (SRE) - Object Storage

Senior SRE position at Apple focusing on distributed storage systems, offering competitive compensation and the opportunity to impact millions of users.

Senior Compute SRE (GPU) - Apple Services Engineering

Senior Compute SRE (GPU) role at Apple Services Engineering, focusing on GPU-accelerated infrastructure and Kubernetes clusters.

Staff Software Engineer, Reliability Engineering

Staff Software Engineer position at Airbnb focusing on Site Reliability Engineering, developing and maintaining tools for service reliability at scale.

Sr Staff Software Engineer, Reliability Engineering

Senior Staff SRE position at Airbnb focusing on reliability architecture, incident management, and technical leadership, offering competitive compensation and remote work flexibility.

Technical Program Manager III, Site Reliability, Storage

Technical Program Manager III position at Google, leading Storage Site Reliability Engineering initiatives and cross-functional programs.