The Pisa team at AWS is dedicated to enhancing the resilience of AWS services by preventing congestive collapse failures. As a Senior Software Engineer, you'll join a small, dynamic team that designs and builds innovative tooling to identify and prevent service vulnerabilities. The role combines academic theory, scientific experimentation, and practical systems engineering to solve complex challenges.
You'll be working on critical infrastructure that helps protect AWS services used by customers worldwide. The position requires someone who can balance writing production-quality code with rapid prototyping when needed. You'll be implementing admission control schemes, retry policies, and scaling simulation infrastructure.
The ideal candidate should be passionate about operational excellence and have experience with large-scale distributed systems. You'll need to be comfortable diving into unfamiliar code and documentation, with a particular interest in system behavior, failure modes, and recovery mechanisms. The role offers a unique blend of scientific collaboration and practical engineering impact.
AWS offers an inclusive culture with employee-led affinity groups, mentorship opportunities, and strong support for work-life harmony. The company values diverse experiences and backgrounds, encouraging applications from candidates with non-traditional career paths. You'll be part of a team that's continuously innovating and directly impacting services used by customers globally.
This role provides an opportunity to work on challenging technical problems while collaborating with scientists and engineers across AWS. You'll be at the forefront of improving cloud infrastructure reliability, making a meaningful impact on AWS's service resilience.