Software Engineer, LLM Pre-Training Optimization

AI research company working on advancing artificial intelligence for widespread public benefit and scientific discovery
$235,000 - $350,000
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
8+ years of experience
AI

Description For Software Engineer, LLM Pre-Training Optimization

Google DeepMind, a leading force in artificial intelligence research, is seeking a Senior Software Engineer to join their Pre-Training team. This role represents a unique opportunity to shape the future of large language models (LLMs) by optimizing their training and inference capabilities at an unprecedented scale.

The position involves working at the intersection of cutting-edge ML model development and hardware optimization, specifically focusing on TPU architecture. You'll be responsible for co-designing models and implementing crucial components across various layers, from model architecture to custom kernels, ensuring maximum efficiency in frontier model delivery.

As part of this role, you'll be working with state-of-the-art LLM models, optimizing their performance on Google's advanced hardware accelerators throughout the entire lifecycle - from research to deployment. Your expertise will be crucial in developing custom kernels, collaborating with compiler and framework teams, and ensuring efficient training at industry-leading scales.

The ideal candidate brings deep expertise in distributed training of LLMs, particularly at the 1e25 FLOPs scale on modern GPU/TPU clusters. Your experience with ML frameworks like JAX and PyTorch, combined with low-level programming expertise in CUDA and OpenCL, will be essential for success in this role.

At Google DeepMind, you'll be part of a diverse team of scientists, engineers, and ML experts working together to advance artificial intelligence for widespread public benefit. The company offers competitive compensation, including a base salary range of $235,000 - $350,000, plus bonus, equity, and comprehensive benefits.

This is an opportunity to work on some of the most challenging and impactful problems in AI, while contributing to the development of technologies that could be one of humanity's most useful inventions. Join a team that prioritizes safety and ethics while pushing the boundaries of what's possible in artificial intelligence.

Last updated 10 hours ago

Responsibilities For Software Engineer, LLM Pre-Training Optimization

  • Optimize pre-training efficiency and performance of LLM models on Google's hardware accelerators
  • Guide model design for inference-efficiency
  • Improve LLM model performance through optimization and custom kernel development
  • Collaborate with compiler, framework, and platform teams
  • Profile models to identify performance bottlenecks
  • Develop low-level custom kernels for critical operators
  • Work with research teams on enabling new critical operators

Requirements For Software Engineer, LLM Pre-Training Optimization

Python
  • Proven track record of contributions to distributed training of LLMs at 1e25 FLOPs scale on GPU/TPU clusters
  • Experience in programming hardware accelerators using ML frameworks (JAX, PyTorch) and low-level programming models (CUDA, OpenCL)
  • Experience with custom kernels and compiler infrastructure
  • Experience with Python and neural network training

Benefits For Software Engineer, LLM Pre-Training Optimization

  • Bonus
  • Equity
  • Benefits package

Interested in this job?

Jobs Related To Google DeepMind Software Engineer, LLM Pre-Training Optimization

Research Engineer

Senior Research Engineer position at Google DeepMind working on Gemini embedding and multimodal AI models

Software Engineer - Health Agent

Senior Software Engineer role developing next-generation AI health agents using Astra and Gemini technologies at Google DeepMind.

Software Engineer

Senior Software Engineer role at Google DeepMind focusing on AI agent development and implementation, requiring 5+ years of experience in software development.

Research Scientist, Strategic Initiatives

Research Scientist position focusing on trustworthy, robust and reliable machine learning research at Google DeepMind

Research Scientist/Engineer - LLM Planning

Research Scientist/Engineer position at Google DeepMind focusing on LLM capabilities in solving planning problems.