ML Framework (MetalLM) Engineer

Apple is a technology company that designs, develops, and sells consumer electronics, computer software, and online services.
Waltham, MA, USA
Machine Learning
Senior Software Engineer
In-Person
3+ years of experience
AI
This job posting may no longer be active. You may be interested in these related jobs instead:
Senior Software Engineer - Apple ML Data Platform

Senior Software Engineer position at Apple working on ML Data Platform, building scalable distributed systems for machine learning infrastructure.

AIML - Software Engineer, Siri Cross Device, Siri and Information Intelligence

Senior Software Engineer position at Apple working on Siri's cross-device intelligence platform, requiring 5+ years of experience and expertise in AI/ML technologies.

AIML - Senior Machine Learning Engineer, Audio Generation, Siri and Information Intelligence

Senior ML Engineer role at Apple working on Siri's audio generation and AI systems, offering competitive pay and the chance to shape the future of conversational AI.

ML Engineer Multimodal Sensing - Incubation

Senior ML Engineer role at Apple focusing on multimodal sensing and AI technologies for next-generation products, offering competitive compensation and comprehensive benefits.

AIML - Sr. Machine Learning Engineer, NL Response Generation - Siri & Information Intelligence

Senior Machine Learning Engineer role at Apple focusing on NLP response generation for Siri, offering competitive compensation and the opportunity to impact millions of users.

Description For ML Framework (MetalLM) Engineer

Apple's ML Frameworks team in GPU, Graphics and Displays org is seeking extraordinary machine learning and GPU programming engineers who are passionate about providing robust compute solutions for accelerating Machine learning libraries on Apple Silicon. The team provides GPU acceleration for popular Machine learning libraries such as TensorFlow, PyTorch and JAX using Metal runtime and device backend. They optimize compute performance with kernels and computational graphs that are fine-tuned for the unique characteristics of each Metal GPU family.

Responsibilities include:

  • Designing and developing compiler-based optimizations for Metal backend in ML frameworks such as torch.compile for PyTorch
  • Working on cutting-edge ML inference framework projects and optimizing code for efficient and scalable ML inference using distributed techniques
  • Implementing features of Metal device backend for ML training acceleration technologies
  • Working with Core teams of PyTorch, JAX or Tensorflow to provide Metal runtime and device backend support
  • Tuning GPU-accelerated training across products
  • Performing in-depth analysis, compiler and kernel level optimizations to ensure the best possible performance across hardware families

The role offers the opportunity to influence the design of compute and programming models in next-generation GPU architectures. The team is always looking for exceptionally dedicated individuals to grow their outstanding team.

Required Qualifications:

  • 3+ years of programming and problem-solving experience with C/C++/ObjC
  • Experience with Distributed training or inference techniques
  • Knowledge of GPU compute programming models & optimization techniques
  • Experience with system-level programming and computer architecture

Preferred Qualifications:

  • Contributions to an AI framework such as PyTorch, JAX or Tensorflow
  • Experience with graph compilers such as Triton, OpenXLA or LLVM/MLIR
  • Good understanding of machine learning fundamentals

Apple is an equal opportunity employer committed to inclusion and diversity. They take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics.

Last updated 2 months ago

Responsibilities For ML Framework (MetalLM) Engineer

  • Design and develop compiler-based optimizations for Metal backend in ML frameworks
  • Work on cutting-edge ML inference framework projects and optimize code for efficient and scalable ML inference
  • Implement features of Metal device backend for ML training acceleration technologies
  • Work with Core teams of PyTorch, JAX or Tensorflow to provide Metal runtime and device backend support
  • Tune GPU-accelerated training across products
  • Perform in-depth analysis, compiler and kernel level optimizations

Requirements For ML Framework (MetalLM) Engineer

Python
  • 3+ years of programming and problem-solving experience with C/C++/ObjC
  • Experience with Distributed training or inference techniques
  • Knowledge of GPU compute programming models & optimization techniques
  • Experience with system-level programming and computer architecture

Interested in this job?