Machine Learning Engineer, AWS Neuron Apps

Amazon

Global technology company leading in e-commerce, cloud computing, and artificial intelligence

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

This job posting may no longer be active. You may be interested in these related jobs instead:

Sr Software Engineer

Amazon

Senior Software Engineer role at Amazon RBKS focusing on AI and computer vision system development for smart home applications, offering competitive compensation and growth opportunities.

Software Development Engineer, Amazon Robotics

Amazon

Senior Software Engineer role at Amazon Robotics, focusing on ML infrastructure and distributed systems for robotics applications.

Software Development Engineer, Amazon Robotics (AR) Sortation Planning

Amazon

Senior ML Engineer role at Amazon Robotics focusing on developing and implementing machine learning solutions for robotic sortation systems and workflow optimization.

Senior Delivery Consultant - Application Developer, Data & Machine Learning, WWPS ProServe

Amazon

Senior Delivery Consultant role at AWS ProServe team focusing on machine learning and data solutions implementation, requiring 5+ years of experience in cloud architecture and ML deployment.

Software Development Engineer - Machine Learning, Ad Response Prediction

Amazon

Senior ML Engineer role at Amazon working on ad response prediction systems, requiring 3+ years of experience in software development.

Description For Machine Learning Engineer, AWS Neuron Apps

AWS Neuron is seeking a Senior Machine Learning Engineer to join their ML Applications team, focusing on the complete software stack for AWS Inferentia and Trainium cloud-scale machine learning accelerators. This role presents an exciting opportunity to work at the forefront of machine learning infrastructure, specifically with massive-scale language models like Llama2, GPT2, and GPT3.

The position combines deep technical expertise in machine learning with high-impact software development. You'll be responsible for developing and optimizing distributed inference solutions, working directly with compiler and runtime engineers. The role requires expertise in performance tuning for both latency and throughput on large models using Python, PyTorch, or JAX, with a particular focus on Deepspeed and other distributed inference libraries.

As part of Amazon's AWS team, you'll work in a startup-like environment while having the resources and impact of a global tech leader. The team culture emphasizes knowledge-sharing and mentorship, with senior members providing one-on-one mentoring and thorough code reviews. You'll collaborate with internal and external stakeholders, participate in critical design discussions, and directly influence business decisions through technical expertise.

The compensation is highly competitive, ranging from $129,300 to $223,600 based on location, plus additional benefits including equity and sign-on payments. This role offers the unique opportunity to work on cutting-edge ML infrastructure that powers some of the most advanced AI models in production today, making it an ideal position for someone looking to make a significant impact in the field of machine learning systems.

Last updated 3 months ago

Responsibilities For Machine Learning Engineer, AWS Neuron Apps

Development, enablement and performance tuning of ML model families
Build distributed inference support into PyTorch, TensorFlow using XLA
Tune models for highest performance on AWS Trainium and Inferentia silicon
Create metrics and implement automation improvements
Participate in design discussions and code reviews
Work cross-functionally to drive business decisions with technical input

Requirements For Machine Learning Engineer, AWS Neuron Apps

Python

3+ years of non-internship professional software development experience
2+ years of non-internship design or architecture experience
Experience programming with at least one software programming language
Experience with Python, PyTorch or JAX
Strong software development skills in C++/Python
ML knowledge and experience

Benefits For Machine Learning Engineer, AWS Neuron Apps

Medical Insurance

Equity

Full range of medical benefits
Financial benefits
Total compensation package including equity
Sign-on payments

Amazon

Global technology company leading in e-commerce, cloud computing, and artificial intelligence

Seattle, WA, USA

$129,300 - $223,600

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

3+ years of experience

AI · Enterprise SaaS

Amazon

Furthest Point From Origin

Data Structures & AlgorithmsEasy

You are given a string moves of length n consisting only of characters 'L', 'R', and '_'. The string represents your movement on a number line starting from the origin 0. In the ith move, you can choose one of the following directions: move to the left if moves[i] = 'L' or moves[i] = '_' move to the right if moves[i] = 'R' or moves[i] = '_' Return the *distance from the origin* of the *furthest* point you can get to after n moves. Example 1: Input: moves = L_RL__R Output: 3 Explanation: The furthest point we can reach from the origin 0 is point -3 through the following sequence of moves LLRLLLR. Example 2: Input: moves = R__LL Output: 5 Explanation: The furthest point we can reach from the origin 0 is point -5 through the following sequence of moves LRLLLLL. Example 3: Input: moves = _ Output: 7 Explanation: The furthest point we can reach from the origin 0 is point 7 through the following sequence of moves RRRRRRR.

Strings

Greedy Algorithms

Amazon

Explain memory management in operating systems with virtual memory, page replacement, fragmentation, and security.

System DesignHard

Let's delve into the fascinating world of operating systems. Imagine you're designing a new OS, and you need to implement a memory management system. Discuss the following: Virtual Memory: Explain the concept of virtual memory. How does it help in managing memory efficiently, especially when dealing with processes that require more memory than physically available? Provide a specific example of a scenario where virtual memory shines, such as running multiple large applications concurrently on a system with limited RAM. What are the benefits and drawbacks of using virtual memory? Specifically discuss the performance overhead associated with page faults. Page Replacement Algorithms: Describe different page replacement algorithms (e.g., FIFO, LRU, Optimal). For each algorithm, provide a step-by-step example using a reference string of page accesses (e.g., 1, 2, 3, 4, 1, 2, 5, 1, 2, 3). Calculate the number of page faults for each algorithm. Discuss the trade-offs between the complexity of the algorithm and its performance in terms of minimizing page faults. Explain a situation where FIFO would perform worse than LRU, and vice versa. Memory Fragmentation: Explain internal and external fragmentation. Give examples for each. What are the causes of each type of fragmentation? What strategies can be used to mitigate memory fragmentation (e.g., compaction, paging, segmentation)? Discuss the advantages and disadvantages of each strategy. Protection and Security: How does the OS protect memory from unauthorized access by different processes? Explain the role of memory protection mechanisms like base and limit registers, and page table entries (PTEs) with protection bits. Describe a scenario where a process attempts to access memory outside of its allocated region, and explain how the OS would handle this situation to prevent security breaches. How can techniques like Address Space Layout Randomization (ASLR) further enhance memory security?

Arrays

Strings

Stacks

Linked Lists

Trees

Recursion

Graphs

Dynamic Programming

Greedy Algorithms

Bit Manipulation

Amazon

Work experience and current project

Behavioral

Tell me about your work experience and current project.

Arrays

Strings

Interested in this job?