At Infrastructure Reliability Engineering within Amazon, we are seeking talented Software Development Engineers to join our team focused on building scalable solutions that ensure the reliability of Amazon's critical systems. Our team develops and operates tools for distributed tracing, network analysis, and event correlation at Amazon scale. We work on detecting and preventing outages to maintain high availability across global infrastructure, directly impacting millions of customers.
The role involves working with core technologies including Java, Python, Linux, and AWS services to build intelligent and real-time insights into service-to-service communications and network traffic. You'll be part of developing solutions that support visibility into anomalous service behavior and ensure high availability for Amazon's fulfillment and robotics services.
We offer a collaborative environment where you'll work alongside talented engineers, Product Managers, Technical Program Managers, and Senior Leadership. The team fosters a culture of continuous learning and professional growth, encouraging innovation and problem-solving at a global scale. You'll have the opportunity to work on greenfield programs while making meaningful contributions to critical systems.
Success in this role requires passion for creating maintainable, high-quality software with robust automated testing, deployed through continuous delivery. Your work will directly impact Amazon's ability to deliver customer orders on time, making this an opportunity to contribute to essential infrastructure at one of the world's largest technology companies.
The position offers comprehensive benefits including medical, dental, and vision coverage, parental leave options, PTO, and a 401(k) plan. Join us to be part of a team that's redefining industry standards while solving complex challenges at unprecedented scale.