Staff Software Engineer, Scaling On-Device ML Frameworks

Google Cloud accelerates organizations' digital transformation with cutting-edge technology and developer tools.
Machine Learning
Staff Software Engineer
In-Person
8+ years of experience
AI

Description For Staff Software Engineer, Scaling On-Device ML Frameworks

Google Cloud is seeking a Staff Software Engineer to work on scaling on-device ML frameworks. The role involves developing next-generation technologies that change how billions of users connect, explore, and interact with information. The focus is on On-Device Machine Learning (ML), especially with the rise of Generative AI, which is central to Google's product portfolio. The team is building core capabilities powering unique experiences on Pixel and Android, such as Speech Recognition, Camera, and Assistant/Translation.

Key responsibilities include:

  1. Integrating frameworks like Hugging Face/JAX/PyTorch with AI Edge Torch and MP LLM Inference API.
  2. Developing sample apps demonstrating best practices in hardware acceleration, quantization, debugging, and optimizing performance and quality.
  3. Onboarding state-of-the-art models to provide excellent self-serve out-of-the-box experiences for users.
  4. Scaling partnerships with Android Apps and Google Cloud Customers and driving improvements in the overall stack based on real customer needs.
  5. Implementing automated processes and infrastructure to test APIs and user-journeys across diverse platforms.

The ideal candidate should have experience with frameworks such as JAX and PyTorch, full-stack Android, iOS, and Web development, OSS development, on-device machine learning deployment, Machine Learning infrastructure, Mobile App Development, and Integration Testing. Experience with iOS Development, Android Development, C++, and Python is preferred.

Google Cloud offers enterprise-grade solutions leveraging cutting-edge technology and tools for developers to build more sustainably. The company is committed to diversity, equality, and inclusion, providing equal employment opportunities to all candidates.

Last updated a month ago

Responsibilities For Staff Software Engineer, Scaling On-Device ML Frameworks

  • Ramp to our frameworks in the form of E2E integrations between popular Hugging Face/JAX/PyTorch libraries and AI Edge Torch, MP LLM Inference API, etc.
  • Sample apps demonstrating best practices including hardware accelerations, quantization, debugging and optimizing performance and quality
  • Onboard SOTA models to provide excellent self-serve out-of-the-box experience for users
  • Scale partnerships with Android Apps and Google Cloud Customers and drive improvements in the overall stack based on real customer needs
  • Define and implement automated processes and infrastructure to test APIs and user-journeys across diverse platforms and advocate for product excellence

Requirements For Staff Software Engineer, Scaling On-Device ML Frameworks

Python
Java
  • Bachelor's degree or equivalent practical experience
  • 8 years of experience in mobile application development or Android framework development
  • 5 years of coding experience in one or more of the following languages: C, C++, Java, or Python
  • Experience with frameworks such as JAX, and PyTorch
  • Full-stack Android, iOS, and Web development experience
  • OSS development experience
  • Experience in on-device machine learning deployment in any domain - vision, speech, text etc.
  • Experience with Machine Learning infrastructure, Mobile App Development, and Integration Testing
  • Experience with iOS Development, Android Development, C++, and Python

Interested in this job?

Jobs Related To Google Staff Software Engineer, Scaling On-Device ML Frameworks

Senior Research Scientist, Interactive Recommender Systems

Senior Research Scientist position at Google Research focusing on interactive recommender systems, machine learning, and AI, offering competitive compensation and benefits.

Staff Research Scientist, Google Cloud AI

Lead AI research scientist position at Google Cloud, focusing on advancing AI technology and its applications across industries while contributing to the research community.

Staff Software Developer, Generative AI, Gemini Code Assist

Lead the development of AI-powered developer tools at Google's Gemini Code Assist team, focusing on machine learning and generative AI applications.

Product Manager, AI/ML, Google Cloud

Lead AI/ML product management at Google Cloud, developing strategic vision for ML hardware stack and collaborating with teams like DeepMind and YouTube.

Senior Research Scientist, Multilingual NLP

Senior Research Scientist position at Google focusing on multilingual NLP and LLMs, requiring PhD and 7+ years of experience in machine learning and natural language processing.