Amazon is seeking a Senior Software Development Engineer to join their Incident Command Systems team, focusing on Monitoring & Detection engineering for their worldwide retail websites. This role is crucial in reimagining incident management & response for Amazon's retail operations. The position involves designing and implementing strategic platforms for the central incident response team, working in a fast-paced environment where every minute matters during system incidents. The role requires deep technical expertise in developing scalable monitoring solutions that can handle thousands of services.
The ideal candidate will work within Amazon's complex architectural landscape, collaborating with service owners across the organization to integrate key performance indicators and develop elegant solutions. They will be part of a team responsible for improving remediation times for outages and building software components for monitoring and anomaly detection.
This is an opportunity to have direct impact on Amazon's operational resilience, working with executive decision-makers and central response teams. The role offers competitive compensation ranging from $151,300 to $261,500 based on location and experience, plus additional benefits including equity and sign-on payments. The position requires 5+ years of professional software development experience and strong leadership capabilities.
The Incident Command Systems team operates in small, efficient groups of 6-10 engineers, following Amazon's "two-pizza team" philosophy. Team members work on critical systems that protect and maintain Amazon's retail experience, requiring both technical excellence and business acumen. This role offers the chance to solve complex technical challenges at massive scale while contributing to the continuous improvement of Amazon's incident management capabilities.