Principal Engineer, Cloud ML Compute Systems

Google Cloud accelerates organizations' digital transformation, delivering enterprise-grade solutions leveraging cutting-edge technology and developer tools.
$278,000 - $399,000
Machine Learning
Principal Software Engineer
In-Person
5000+ Employees
15+ years of experience
AI · Enterprise SaaS

Description For Principal Engineer, Cloud ML Compute Systems

Google Cloud is seeking a Principal Engineer to join their team in a crucial role focused on machine learning optimization and performance enhancement. This position combines deep technical expertise with customer-facing responsibilities, making it an ideal opportunity for seasoned engineers passionate about ML systems.

The role involves leading critical initiatives to optimize machine learning models for Google Cloud's customers, working at both software and hardware levels. You'll be at the forefront of performance optimization, working with cutting-edge technologies including GPUs, TPUs, and various ML frameworks like PyTorch, JAX, and TensorFlow.

As a Principal Engineer, you'll be responsible for developing and implementing sophisticated optimization strategies, writing custom kernels, and creating bespoke solutions to meet specific customer requirements. The position requires a unique blend of technical depth in ML systems, performance optimization, and strong communication skills to effectively collaborate with customers and internal teams.

Google Cloud's position as a leader in cloud computing and ML infrastructure makes this an exceptional opportunity to impact the industry at scale. The role offers competitive compensation, including a substantial base salary range of $278,000-$399,000, plus bonus, equity, and comprehensive benefits.

The ideal candidate will bring 15+ years of software engineering experience, with deep expertise in ML frameworks, hardware accelerators, and performance optimization. You'll be joining a company known for its cutting-edge technology and commitment to innovation, working on solutions that help organizations across 200+ countries transform their businesses.

This role is perfect for someone who enjoys tackling complex technical challenges while having a direct impact on customer success. You'll be working in Sunnyvale, CA, at the heart of Silicon Valley, collaborating with some of the industry's best minds in machine learning and cloud computing.

If you're passionate about optimizing ML systems, have a track record of technical leadership, and want to work on technology that shapes the future of cloud computing and machine learning, this role offers an exceptional opportunity to advance your career while making a significant impact.

Last updated 2 days ago

Responsibilities For Principal Engineer, Cloud ML Compute Systems

  • Lead efforts to optimize machine learning models for speed, memory efficiency, and accuracy
  • Accelerate model training and inference by identifying and implementing software and hardware optimizations
  • Translate customer requirements into technical solutions
  • Identify bottlenecks and areas for improvement by developing and utilizing performance analysis tools

Requirements For Principal Engineer, Cloud ML Compute Systems

Python
  • Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience
  • 15 years of experience as a software engineer
  • Experience working with GPUs and other hardware accelerators
  • Experience working with ML Frameworks (PyTorch, JAX, or TensorFlow)
  • Experience in optimizing machine learning models
  • Experience writing custom kernels (CUDA, Pallas, etc.)
  • Experience with ML workload performance profiling and analysis
  • Excellent communication and people skills
  • Strong programming skills in Python or C/C++

Benefits For Principal Engineer, Cloud ML Compute Systems

Medical Insurance
Dental Insurance
Vision Insurance
  • bonus
  • equity
  • benefits

Interested in this job?

Jobs Related To Google Principal Engineer, Cloud ML Compute Systems

Managing Director, AI GTM Field Solutions Architect

Lead Google Cloud's AI GTM Field Solutions Architecture, managing global teams and driving AI/ML technology implementation for enterprise clients.

Customer Engineer, Applied AI, Google Cloud

Lead AI/ML solutions implementation as a Customer Engineer at Google Cloud, guiding enterprises in their digital transformation using cutting-edge AI technologies.

Customer Engineer, Applied AI, Google Cloud

Lead AI/ML solutions implementation as a Customer Engineer at Google Cloud, guiding enterprises in their digital transformation journey.

Senior Staff Research Scientist

Senior Staff Research Scientist position at Google Research, focusing on machine learning and AI development with competitive compensation and opportunity to shape future technologies.

Product Manager, TPU

Product Manager position at Google focusing on TPU Machine Learning infrastructure, requiring 10+ years of experience in product management and expertise in data centers and hardware.