Apple's applied research and engineering team is seeking a Machine Learning Model Optimization Engineer to join their innovative group responsible for developing real-time on-device Language, Computer Vision, and Machine Perception technologies. The role focuses on optimizing and deploying Apple Intelligence LLM and diffusion models across Apple products.
As a key member of the team, you'll be working at the forefront of technological advancement, implementing cutting-edge optimization techniques for large language and diffusion models on devices. The position offers the unique opportunity to influence Apple's sensor and silicon roadmap while collaborating across hardware, software, and ML teams.
The ideal candidate brings 10+ years of industry experience, strong Python skills, and extensive knowledge in model compression algorithms, including quantization, pruning, and distillations. You'll be expected to lead large-scale projects, demonstrate excellent communication skills, and have a passion for shipping machine learning models on device.
This role provides an exceptional opportunity to impact Apple's entire ML model lifecycle while working with state-of-the-art technologies. The position offers comprehensive benefits, including competitive base pay, stock options, medical coverage, and educational support. Join Apple in pushing the boundaries of on-device AI technology and helping shape the future of intelligent computing experiences.