AWS Neuron is seeking a Senior Machine Learning Compiler Engineer to join their innovative team working on the SDK that optimizes ML models for AWS Inferentia and Trainium custom chips. This role offers an exciting opportunity to be at the forefront of AI revolution, working on next-generation compiler technology that transforms ML models from frameworks like PyTorch, TensorFlow, and JAX for deployment on AWS hardware.
The position involves solving complex compiler optimization problems to achieve optimal performance for various ML model families, including large language models, stable diffusion, and vision transformers. You'll work closely with chip architects, runtime engineers, and ML teams to deliver cutting-edge solutions that democratize access to AI infrastructure.
As a senior engineer, you'll lead the design and implementation of compiler optimizations, collaborate with open-source communities, and influence industry-wide partners. The role requires strong expertise in object-oriented programming (C++/Java) and compiler technology. Experience with LLVM/MLIR and ML frameworks is highly valued.
AWS offers a collaborative environment with emphasis on mentorship and career growth. The team maintains a startup-like atmosphere while providing the resources and stability of a global tech leader. You'll have opportunities to work on impactful projects that shape the future of machine learning infrastructure.
Benefits include comprehensive healthcare, work-life harmony, and extensive professional development opportunities. The position offers competitive compensation ranging from $151,300 to $261,500 based on location and experience, plus equity and additional benefits.
Join AWS to be part of a diverse, inclusive culture that values innovation and technical excellence. This role provides an unique opportunity to influence the future of AI infrastructure while working with cutting-edge technology at global scale.