AWS Hardware Engineering team is seeking a Systems Development Engineer to build the backbone of Generative AI cloud at AWS. This role focuses on designing, delivering, and operating AWS cloud offerings that enable high performance and scalability in AI/ML and HPC workloads. The position involves creating server designs that are industry-leading in frugality and operational excellence, critical to AWS's success and millions of customers.
The ideal candidate will be an innovative self-starter with comprehensive knowledge of the full technical stack - from baremetal server hardware to userland software. You'll work on delivering continuous price performance improvements for AI model training for multi-billion variable LLMs, while solving challenging technology problems and building architecturally sound components.
As part of the Hardware Engineering AI/ML development team, you'll collaborate with diverse teams across AWS, including SDEs, Hardware Engineers, and TPMs. Located in Seattle or Cupertino, you'll work on programs with global development teams and manage servers in datacenters worldwide. The role offers significant impact on AWS's bottom line and the opportunity to shape the future of cloud computing technology.
Key responsibilities include solving complex architectural problems, owning team systems, proactive issue identification, and leading the delivery of solutions. You'll use a combination of hardware, software, system designs, x86 architecture, and operations knowledge to drive high quality and reliability into AWS Accelerated server solutions.
The position offers competitive compensation ranging from $136,100 to $235,200 based on location, plus equity and comprehensive benefits. This is an excellent opportunity for experienced engineers passionate about cloud computing, AI/ML infrastructure, and building at scale.