AWS Infrastructure Services is seeking an experienced Software Development Engineer to join the Pisa team, focused on improving AWS services' resiliency by preventing congestive collapse failures. This role combines academic theory, scientific experimentation, and pragmatic systems engineering to tackle complex distributed systems challenges.
The position involves working with a small, fast-moving team that designs and builds tooling to identify potential vulnerabilities before they impact services. You'll be responsible for implementing critical components of the Pisa tool suite, including admission control schemes, retry policies, and scaling infrastructure for modeling and simulation.
As an ideal candidate, you'll bring experience with large-scale distributed systems and a passion for operational excellence. The role requires proficiency in Python and Java, along with the ability to write maintainable, well-tested code. You should be comfortable prototyping solutions quickly and using creative problem-solving skills to overcome challenges.
At AWS, you'll join the team responsible for keeping the cloud running, supporting all AWS data centers, servers, storage, networking, and infrastructure. You'll collaborate with diverse professionals across AWS to maintain the highest standards for safety and security while optimizing capacity and cost for customers.
The company offers a strong culture of inclusion, with employee-led affinity groups, ongoing learning experiences, and a commitment to work-life harmony. You'll have opportunities for mentorship and career growth in an environment that values diverse experiences and perspectives.
This role is perfect for someone who enjoys both the intellectual challenge of working with scientists and the practical impact of improving services used by customers worldwide. Join AWS to help shape the future of cloud infrastructure while working on some of technology's most challenging problems.