The AWS Neuron Compiler team is seeking skilled compiler engineers to develop a state-of-the-art deep learning compiler stack. This stack optimizes application models across diverse domains, including Large Language and Vision, from frameworks like PyTorch, TensorFlow, and JAX. You'll work with custom-built Machine Learning accelerators like Inferentia/Trainium, powering solutions such as Generative AI.
As a Sr. ML Compiler Engineer III on the Neuron Compiler Automated Reasoning Group, you will:
- Develop and maintain tooling for best-in-class technology to improve the Neuron Compiler's accuracy and reliability
- Lead efforts in building fuzzers and specification synthesis tooling for our LLVM-based compiler
- Work in a science-focused team, pushing the boundaries of knowledge to best serve our customers
- Utilize strong software development skills in C++/Python
Key qualifications:
- 3+ years of leading design or architecture of new and existing systems
- 2+ years of experience in developing compiler features and optimizations
- Proficiency in C++ and Python programming, applied to compiler or verification projects
- Familiarity with LLVM, including knowledge of abstract interpretation and polyhedral domains
- Demonstrated scientific approach to software engineering problems
Preferred qualifications:
- Masters degree or PhD in computer science or equivalent
- Experience with deep learning frameworks like TensorFlow or PyTorch
- Understanding of large language model (LLM) training processes
- Knowledge of CUDA programming for GPU acceleration
Location options: Seattle (Preferred), Cupertino, Austin, or Toronto
The base pay for this position ranges from $129,300/year in the lowest geographic market up to $223,600/year in the highest geographic market. Pay is based on factors including market location, job-related knowledge, skills, and experience. Amazon offers a comprehensive benefits package and total compensation approach.