Site Reliability Engineer, ESC Managed Operations

World's most comprehensive and broadly adopted cloud platform, pioneering cloud computing and continuous innovation.
Site Reliability
Senior Software Engineer
In-Person
5,000+ Employees
3+ years of experience
Enterprise SaaS · Cloud

Description For Site Reliability Engineer, ESC Managed Operations

AWS is launching its first European Sovereign Cloud (ESC), a groundbreaking initiative in utility computing. As a Site Reliability Engineer in the AWS Managed Operations team, you'll play a crucial role in building and leading operations for high-availability AWS services including EC2, S3, Dynamo, Lambda, and Bedrock, specifically for EU customers.

The role combines day-to-day operations management with long-term software engineering to reduce operational toil. You'll work at the intersection of technology leadership and hands-on engineering, focusing on improving service availability, reliability, latency, performance, and efficiency. The position involves collaboration with global AWS teams and direct influence on AWS services evolution.

AWS, as the world's leading cloud platform, offers an environment of continuous innovation and learning. The role is based in Dublin, Ireland, and includes on-call responsibilities. You'll be part of the Utility Computing (UC) organization, supporting a wide range of services from foundational offerings like S3 and EC2 to cutting-edge innovations.

The ideal candidate brings 3+ years of software development experience, strong Linux and networking fundamentals, and proficiency in languages like Java, TypeScript, Python, or Ruby. You'll work in an inclusive environment that values diverse experiences and work-life harmony, with access to extensive career development resources and mentorship opportunities.

This is an exceptional opportunity to shape the future of cloud computing in Europe while working with cutting-edge technology at scale. You'll be part of a team that prioritizes innovation, customer satisfaction, and technical excellence, while enjoying the benefits of working for a global technology leader committed to becoming Earth's Best Employer.

Last updated 20 days ago

Responsibilities For Site Reliability Engineer, ESC Managed Operations

  • Oversee the launch of the European Sovereign Cloud (ESC) in 2025
  • Work with global AWS teams to influence AWS services and technology evolution
  • Collaborate with technology leaders to enhance day-to-day operations
  • Ensure improvements in availability, reliability, latency, performance, and efficiency
  • Participate in on-call rotations for incident resolution
  • Build and lead operations and development teams for high-availability AWS services
  • Support development and management of Compute, Database, Storage, IoT, Platform, and Productivity Apps services

Requirements For Site Reliability Engineer, ESC Managed Operations

Python
Java
TypeScript
Linux
Ruby
  • 3+ years of experience in software development with proficiency in Java, Typescript, Python, or Ruby
  • 3+ years of experience with Linux, command line, and computer networking fundamentals
  • Ability to troubleshoot at all levels - network, operating systems, and software applications
  • Experience supporting cloud systems
  • Fluency in written and spoken English
  • Legal right to work in Ireland

Benefits For Site Reliability Engineer, ESC Managed Operations

Relocation Benefits
  • Relocation support within European Union
  • Work-life harmony focus
  • Mentorship and career growth opportunities
  • Employee-led affinity groups
  • Inclusive team culture
  • Continuous learning and development

Interested in this job?

Jobs Related To Amazon Site Reliability Engineer, ESC Managed Operations

Site Reliability Engineer, CloudWatch Infrastructure

Senior SRE role at AWS CloudWatch managing large-scale infrastructure automation and monitoring systems, focusing on operational excellence and infrastructure improvement.

Site Reliability Engineer, CloudWatch Infrastructure

Senior SRE role at AWS CloudWatch managing large-scale infrastructure and automation for one of the world's largest monitoring services.

Sr. Site Reliability Engineer, Infrastructure Engineering

Senior Site Reliability Engineer role at Amazon Prime Video, focusing on infrastructure engineering and cloud systems.

Senior Site Reliability Engineer

Senior Site Reliability Engineer role at Zscaler, focusing on cloud infrastructure, automation, and maintaining high-availability systems across AWS, Azure, and GCP.

Senior Site Reliability Engineer

Senior SRE position at Blacklane focusing on system reliability, observability, and mentoring, offering hybrid work and equity in a global mobility company.