Staff Site Reliability Engineer, ASE - Cloud Platforms

A leading technology company that designs, develops, and sells consumer electronics, software, and services.
$180,000 - $280,000
Site Reliability
Staff Software Engineer
In-Person
5,000+ Employees
8+ years of experience
Enterprise SaaS

Description For Staff Site Reliability Engineer, ASE - Cloud Platforms

Join Apple Service Engineering as a Staff Site Reliability Engineer where you'll play a crucial role in supporting and scaling cloud services for thousands of development and operations engineers. This position demands expertise in maintaining uncompromising scalability, high availability, and seamless performance. As an SRE, you'll establish practices for private/public cloud services, accelerating the delivery of thousands of applications reliably and consistently. You'll work hands-on with cloud platforms, container technologies, and modern observability tools while collaborating closely with developers and architects to design and implement solutions for improved stability, security, and scalability. The role combines traditional SRE responsibilities with innovative problem-solving, requiring expertise in Python, GoLang, Kubernetes, and various monitoring tools. You'll be responsible for creating documentation, automating processes, and ensuring system resilience through careful planning and testing. This is an opportunity to make a real difference in a company known for its commitment to excellence and innovation in technology.

Last updated 4 days ago

Responsibilities For Staff Site Reliability Engineer, ASE - Cloud Platforms

  • Support and scale cloud services for thousands of development and operations engineers
  • Establish SRE practices for private/public cloud service
  • Maintain system reliability, security, and scalability
  • Collaborate with developers and architects on system design
  • Operate and monitor production and non-production environments
  • Create alert handling procedures and runbooks
  • Automate service deployment and orchestration
  • Participate in capability planning and disaster recovery exercises

Requirements For Staff Site Reliability Engineer, ASE - Cloud Platforms

Python
Go
Kubernetes
  • Experience with major public cloud providers and cloud-native services
  • Proficiency in Kubernetes to deploy, operate and troubleshoot container based applications
  • Strong knowledge of SRE principles including monitoring, alerting, error budgets
  • Expertise in implementing telemetry using tools like Splunk, Grafana, and Prometheus
  • Proficiency in Python, GoLang for developing automation scripts and tools
  • Excellent interpersonal and communication skills
  • Technical (Engineering or Computer Science) BS/MS degree or equivalent work experience

Interested in this job?

Jobs Related To Apple Staff Site Reliability Engineer, ASE - Cloud Platforms

Sr. Engineering Program Manager, Security Site Reliability Engineering, Apple Services Engineering

Senior Engineering Program Manager role at Apple Services Engineering, focusing on security and reliability engineering for cloud infrastructure at scale.

Senior Engineering Program Manager, iCloud SRE, Apple Services Engineering

Senior Engineering Program Manager position at Apple Services Engineering, focusing on iCloud SRE team management and infrastructure development.

Messaging SRE Manager, Apple Services Engineering

Lead SRE position at Apple managing reliability for critical messaging services like iMessage and FaceTime, offering competitive pay and benefits.

Senior SRE Manager, iCloud

Lead iCloud's SRE teams at Apple, managing service reliability and performance while building high-performing engineering teams in Seattle.

Service Reliability Engineering (SRE) Manager, Analytics

Lead SRE Manager position at Apple Services Engineering, overseeing analytics infrastructure and engineering teams for global-scale data processing systems.