AWS Hardware Engineering team is seeking a Systems Development Engineer to build the backbone of Generative AI cloud infrastructure. This role focuses on developing and implementing next-generation AWS platforms for AI training and inference, particularly for multi-billion variable LLMs. The position combines hardware engineering with cloud computing expertise, requiring deep understanding of server systems from bare metal to user-level software.
The role involves working with AWS Hardware Engineering's AI/ML development team, collaborating across global development teams in Seattle, Cupertino, and Austin. You'll be responsible for launching hardware in the fleet and managing servers located in datacenters globally. The position demands expertise in both hardware and software integration, system architecture, and operational excellence.
As a technical leader, you'll solve complex architectural problems, own team systems, and work proactively to identify and address deficiencies before they impact customers. The role requires strong debugging skills, leadership capabilities, and the ability to work effectively with various teams including SDEs, Hardware Engineers, and TPMs.
AWS values diverse experiences and work-life harmony, offering comprehensive benefits including medical coverage, equity compensation, and career development opportunities. The company maintains an inclusive culture through employee-led affinity groups and ongoing learning experiences. This position offers the opportunity to shape the future of cloud computing technology while working with cutting-edge AI/ML infrastructure.
The ideal candidate will have strong programming skills in modern languages, experience with Linux/Unix environments, and a proven track record in systems development and architecture. You'll be part of AWS Infrastructure Services (AIS), which owns the design, planning, delivery, and operation of all AWS global infrastructure, working on challenging problems that directly impact millions of AWS customers.