Google's Core ML team is seeking a Principal Engineer to lead GPU technology evaluation and optimization initiatives. This role combines deep technical expertise in GPU architecture with strategic leadership in machine learning infrastructure. The position involves optimizing ML models for state-of-the-art performance on GPUs, working closely with both cloud and internal customers, and creating robust infrastructure for ML systems access.
The role requires extensive experience (15+ years) in GPU performance optimization and inference work. The successful candidate will evaluate upcoming hardware technologies, advocate for GPU use-cases, and bridge the gap between TPU and GPU offerings. They will lead efforts to achieve industry-leading performance for key models through both traditional optimization techniques and automated infrastructure development.
As part of Google Cloud, which serves customers in over 200 countries, this position offers the opportunity to impact global digital transformation initiatives. The role comes with competitive compensation ($294,000-$414,000 base salary) plus additional benefits including bonus and equity packages.
The ideal candidate will combine technical depth in GPU architectures and ML optimization with strong leadership and communication skills. They will guide junior engineers, work directly with customers, and drive strategic technical decisions. This is an opportunity to shape the future of ML infrastructure at one of the world's leading technology companies while working with cutting-edge hardware and software technologies.