Join Apple's Foundation Model Services team within the Machine Learning Platform Technologies organization, the backbone of Apple Intelligence. We build frameworks, services, and tools that power Apple's largest foundation models on servers. Our infrastructure supports crucial services including Apple Search, Apple Music, AppleTV, AppStore, iMessages, Photos & Camera, Spotlight, Safari, Siri, and upcoming products, serving millions of queries daily with incredibly low latencies.
As a Machine Learning Engineer, you'll work on optimizing billions of parameter language, vision, and speech models using state-of-the-art technologies at Apple's scale. You'll have the opportunity to impact billions of users worldwide, working with cutting-edge model architectures and high-throughput services at supercomputing scale.
The role involves close collaboration with product teams and the Foundation Model Research team, building production-grade solutions and developing inference capabilities for advanced model architectures. You'll be instrumental in building tools to analyze inference bottlenecks across different hardware configurations and use cases.
We're seeking someone who thinks differently, is eager to break the status quo, and isn't afraid to take risks. You'll be joining a team that values innovation and pushes the boundaries of computing and intelligence. This position offers the chance to mentor other engineers while working on technology that brings smiles to people's faces.
The ideal candidate brings strong expertise in ML technologies, including LLMs, NLP, and Information Retrieval, combined with practical experience in cloud infrastructure and modern programming languages. Your work will directly contribute to Apple's mission of bringing intelligent features to billions of users across their product ecosystem.