Research Engineer - Gemini Pretraining

Google DeepMind

AI research company working on advancing artificial intelligence for widespread public benefit and scientific discovery

New York, NY, USA

$136,000 - $300,000

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Description For Research Engineer - Gemini Pretraining

Google DeepMind is at the forefront of artificial intelligence research, working to advance AI for widespread public benefit and scientific discovery. We're seeking a Senior Research Engineer to join our team working on the cutting-edge Gemini project.

This role focuses on the critical task of inference-optimized model design for Gemini pretraining, requiring expertise in both theoretical machine learning and practical implementation. You'll be working with state-of-the-art Large Language Models (LLMs), focusing on optimizing architecture selection for fast inference while maintaining predictable quality.

The position demands a deep understanding of XLA primitives and practical experience with JAX on TPUs. You'll be involved in implementing various distillation techniques and curating datasets for model training. The role spans the entire LLM preparation stack, from pretraining to serving, requiring both technical depth and breadth.

We offer a competitive compensation package ranging from $136,000 to $300,000, plus bonus and equity. Our benefits include comprehensive medical and dental insurance, enhanced parental leave, and flexible working options. The role is based in New York City, with relocation support available.

The ideal candidate will thrive in an ambiguous environment, demonstrate flexibility in approach, and have a proven track record in machine learning and large-scale model development. You'll be joining a diverse team of scientists, engineers, and ML experts working together to push the boundaries of AI technology.

This is an exceptional opportunity to work on groundbreaking AI technology that could be one of humanity's most useful inventions. You'll be part of a team that prioritizes safety and ethics while collaborating on critical challenges in the field of artificial intelligence.

Last updated 3 months ago

Responsibilities For Research Engineer - Gemini Pretraining

Focus on inference-optimized model design for Gemini pretraining
Work on predictable LLM quality with optimized architecture selection
Implement various distillation techniques
Work across the LLM preparation stack (pretraining, finetuning, serving)
Dataset curation for models
Collaborate with researchers and product teams

Requirements For Research Engineer - Gemini Pretraining

Python

TypeScript

BSc, MSc or PhD/DPhil degree in computer science, mathematics, applied stats, machine learning or similar experience
Proven knowledge and experience of Python or C++
Knowledge of machine learning and statistics
Proven experience working with Large Language Models (LLMs)
Knowledge of algorithm design
Experience with Tensorflow or similar ML frameworks (e.g. JAX)
Experience fine-tuning large models
Software Engineering experience
Great communication skills and proven interpersonal skills

Benefits For Research Engineer - Gemini Pretraining

Medical Insurance

Dental Insurance

Parental Leave

Relocation Benefits

Visa Sponsorship

Enhanced maternity, paternity, adoption, and shared parental leave
Private medical and dental insurance for employee and dependents
Flexible working options
On-site gym
Healthy food
Faith rooms
Terraces
Immigration support
Relocation support

Google DeepMind

AI research company working on advancing artificial intelligence for widespread public benefit and scientific discovery

New York, NY, USA

$136,000 - $300,000

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Interested in this job?

Jobs Related To Google DeepMind Research Engineer - Gemini Pretraining

Research Engineer

Google DeepMind

Research Engineer position at Google DeepMind working on cutting-edge ML models and AI technologies in Seattle.

Software Engineer, LLM Pre-Training Optimization

Google DeepMind

Senior Software Engineer role at Google DeepMind focusing on optimizing pre-training efficiency for large language models using TPUs and custom kernels.

Research Engineer

Google DeepMind

Senior Research Engineer position at Google DeepMind working on Gemini embedding and multimodal AI models

Software Engineer - Health Agent

Google DeepMind

Senior Software Engineer role developing next-generation AI health agents using Astra and Gemini technologies at Google DeepMind.

Software Engineer

Google DeepMind

Senior Software Engineer role at Google DeepMind focusing on AI agent development and implementation, requiring 5+ years of experience in software development.