Senior Software Engineer, AWS Neuron Inference

Amazon

Amazon is a global technology company providing cloud computing, AI, and e-commerce services.

Seattle, WA, USA

$151,300 - $261,500

Machine Learning

Senior Software Engineer

In-Person

5+ years of experience

AI · Enterprise SaaS

Description For Senior Software Engineer, AWS Neuron Inference

AWS Neuron is Amazon's complete software stack for cloud-scale machine learning accelerators, specifically the AWS Inferentia and Trainium. This Senior Software Engineering role is part of the Machine Learning Inference Applications team, focusing on developing and optimizing core LLM inference components.

The position involves working with cutting-edge LLM technology, including attention mechanisms, MLP, quantization, speculative decoding, and mixture of experts. You'll collaborate with chip architects, compiler engineers, and runtime engineers to maximize performance on Neuron devices for various models like Llama 3.3 70B, 3.1 405B, DBRX, and Mixtral.

The team culture emphasizes knowledge-sharing and mentorship, with senior members providing one-on-one mentoring and thorough code reviews. Career growth is prioritized through strategic project assignments that help develop engineering expertise. The role offers competitive compensation ranging from $151,300 to $261,500 based on location, plus equity and comprehensive benefits.

This is an excellent opportunity for experienced engineers passionate about machine learning optimization and looking to work on large-scale, impactful projects. The position requires strong programming skills, understanding of ML fundamentals, and the ability to work collaboratively across teams. Amazon's inclusive culture and commitment to diversity make it an ideal workplace for innovation and professional growth.

Last updated 3 days ago

Responsibilities For Senior Software Engineer, AWS Neuron Inference

Development and performance optimization of core building blocks of LLM Inference
Work on Attention, MLP, Quantization, Speculative Decoding, Mixture of Experts
Adapt latest research in LLM optimization to Neuron chips
Work across teams and organizations
Extract best performance from both open source and internally developed models

Requirements For Senior Software Engineer, AWS Neuron Inference

Python

Java

Bachelor's degree in computer science or equivalent
5+ years of full software development life cycle experience
5+ years of programming using modern programming languages (Java, C++, or C#)
Experience with object-oriented design
Understanding of Machine learning models, architectures, training and inference lifecycles

Benefits For Senior Software Engineer, AWS Neuron Inference

Medical Insurance

Dental Insurance

Vision Insurance

401k

Equity

Medical benefits
Financial benefits
Equity compensation
Sign-on payments
Comprehensive benefits package

Amazon

Amazon is a global technology company providing cloud computing, AI, and e-commerce services.

Seattle, WA, USA

$151,300 - $261,500

Machine Learning

Senior Software Engineer

In-Person

5+ years of experience

AI · Enterprise SaaS

Amazon

Find the contiguous subarray with the largest sum.

Data Structures & AlgorithmsMedium

You are given an array of integers. Write a function to find the contiguous subarray with the largest sum and return that sum. For example: Given the array [-2, 1, -3, 4, -1, 2, 1, -5, 4], the contiguous subarray [4, -1, 2, 1] has the largest sum = 6. Given the array [1, 2, 3, 4, 5], the contiguous subarray [1, 2, 3, 4, 5] has the largest sum = 15. Given the array [-1, -2, -3], the contiguous subarray [-1] has the largest sum = -1. Your function should be efficient and handle arrays with both positive and negative numbers. Explain the time complexity of your solution. Can you implement this using dynamic programming? Are there other approaches? Discuss the tradeoffs of the various potential solutions.

Arrays

Dynamic Programming

Amazon

Describe how you would debug a performance bottleneck in an e-commerce platform during peak shopping season.

System DesignMedium

Let's dive into a debugging scenario. Imagine you are working on a critical e-commerce platform used by millions of customers. During a peak shopping period (like Black Friday), you receive alerts indicating a significant slowdown in the checkout process. Customers are reporting that it takes an unusually long time to complete their purchases, and some are even experiencing transaction failures. This is directly impacting sales and customer satisfaction. Your team suspects a performance bottleneck somewhere in the system. The architecture involves multiple microservices, including a user authentication service, a product catalog service, an inventory service, a payment processing service, and an order management service. These services communicate with each other via REST APIs and message queues. The database consists of both relational (PostgreSQL) and NoSQL (Redis) databases. You have access to logging, monitoring tools (like Prometheus and Grafana), and distributed tracing (like Jaeger). How would you approach debugging this issue systematically, identify the root cause, and propose a solution? Be specific about the tools and techniques you would use at each stage. For example, walk me through what metrics you'd focus on, how you would narrow down the problematic service, and what steps you'd take to pinpoint the exact code or configuration causing the bottleneck. Provide specific examples of commands or queries you might use to gather data or test hypotheses. Also consider potential causes related to scaling, caching, database performance, and code inefficiencies. How would you ensure minimal disruption to the platform during the debugging process? How do you handle conflicting information or dead ends during debugging?

Amazon

Tell me about a project that you're most proud of

Behavioral

Tell me about a project that you're most proud of. To help me understand the scope and impact, please provide specific details about the project, including: The problem: What challenge were you trying to solve, and why was it important? Your role: What specific responsibilities did you have on the project? What were your key contributions? The technical details: What technologies, tools, or techniques did you use? For example, did you work with a specific programming language, framework, or database? The outcome: What were the results of the project? Did you meet the initial goals? Did you encounter any unexpected benefits or challenges? What did you learn?: Looking back, what would you do differently? What skills or knowledge did you gain from the experience? Did you face any setbacks, and how did you overcome them? For instance, if you worked on a project to improve the performance of a web application, you might discuss the specific performance bottlenecks you identified, the optimization techniques you implemented (e.g., caching, code optimization), and the resulting improvement in response time or throughput. Or, if you designed a new feature for a mobile app, you could describe the user research you conducted, the design process you followed, and the metrics you used to measure the feature's success. Be prepared to discuss technical challenges you faced and how you addressed them.

Interested in this job?

Jobs Related To Amazon Senior Software Engineer, AWS Neuron Inference

Senior Software Development Engineer, Ring & Blink AI

Amazon

Senior Software Engineer position at Amazon's Ring & Blink AI team focusing on computer vision and machine learning software development for smart home devices.

Senior Software Developer, Amazon Games AI

Amazon

Senior Software Developer position at Amazon Games focusing on implementing ML, RL, and Generative AI techniques for game development.

Product Development Engineer, Annapurna Labs Silicon Operations

Amazon

Senior Product Development Engineer role at AWS-Annapurna Labs focusing on silicon yield optimization for machine learning accelerator servers.

Senior Software Development Engineer - Amazon Music Machine Learning

Amazon

Senior ML Engineer role at Amazon Music, leading recommendation systems development and team mentoring in Berlin.

Emulation Engineer, AWS Annapurna Labs

Amazon

Senior Emulation Engineer position at Amazon working on AWS Inferentia and ML acceleration hardware, offering competitive compensation and benefits.