Research Engineer, Media Understanding- Multimodal Representation Models

Google DeepMind

Google DeepMind is a leading AI research company focused on advancing artificial intelligence and its applications across Google products.

San Francisco, CA, USA

$215,000 - $250,000

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Description For Research Engineer, Media Understanding- Multimodal Representation Models

Google DeepMind is seeking a Research Engineer to join their Media Understanding team, focusing on advancing state-of-the-art research in Embedding/representation models within the context of large language models. This role presents a unique opportunity to work at the intersection of computer vision, language understanding, and multimodal AI systems.

The position is part of a dynamic team comprising research engineers, scientists, and machine learning experts working towards enabling superhuman understanding of the visual world. The primary focus is on developing the most powerful omnimodal embedding models for retrieval and other agentic use cases in Google products.

As a Research Engineer, you'll be at the forefront of developing models that impact billions of users worldwide. Your responsibilities will include conducting core research in computer vision and language understanding, training and evaluating AI models, and implementing cutting-edge deep learning approaches. The role requires collaboration with various teams to drive innovation and product development.

The ideal candidate should possess either a Ph.D. in Computer Science or related field, or a B.S./M.S. with 5+ years of relevant experience. Strong research experience, publication record, and hands-on experience with Google-scale infrastructure would be advantageous. The position offers competitive compensation ranging from $215,000 to $250,000, plus bonus, equity, and comprehensive benefits.

This role represents an exceptional opportunity to shape the future of multimodal AI while working with world-class researchers and engineers. You'll be contributing to groundbreaking research that directly impacts how Google products understand and interact with diverse media formats, including text, images, audio, and video.

Join Google DeepMind's commitment to diversity and innovation, where your unique perspective will contribute to creating extraordinary impact in the field of artificial intelligence and machine learning.

Last updated 3 months ago

Responsibilities For Research Engineer, Media Understanding- Multimodal Representation Models

Conducting core research in computer vision, language understanding, multimodal models, and large scale AI models
Training and evaluating AI models for various product use cases
Implementing and adapting state of the art deep learning approaches
Collaborating with GDM and partner teams to build advanced embedding models
Transform prototypes into scalable solutions for Google's products

Requirements For Research Engineer, Media Understanding- Multimodal Representation Models

Python

Ph.D. in Computer Science or related field, or B.S./M.S. with 5+ years of relevant experience
Experience in machine learning models and techniques
Ability to transform prototypes into scalable solutions
Research and problem-solving capabilities
Strong collaboration skills
Experience with core software engineering and applied AI implementations
Publication record in top tier conferences (preferred)
Experience with Google-scale infrastructure (preferred)

Benefits For Research Engineer, Media Understanding- Multimodal Representation Models

Medical Insurance

401k

Bonus
Equity
Benefits Package

Google DeepMind

Google DeepMind is a leading AI research company focused on advancing artificial intelligence and its applications across Google products.

San Francisco, CA, USA

$215,000 - $250,000

Machine Learning

Senior Software Engineer

In-Person

5,000+ Employees

5+ years of experience

Interested in this job?

Jobs Related To Google DeepMind Research Engineer, Media Understanding- Multimodal Representation Models

Forward Deployment Engineer, Applied AI

Google DeepMind

Senior Forward Deployment Engineer position at Google DeepMind focusing on developing and deploying novel applications using generative AI models.

Forward Deployment Engineer, Applied AI

Google DeepMind

Senior Forward Deployment Engineer position at Google DeepMind focusing on developing and deploying novel applications using generative AI models.

Forward Deployment Engineer, Applied AI

Google DeepMind

Senior Forward Deployment Engineer position at Google DeepMind focusing on developing and deploying novel applications using generative AI models.

Research Engineer

Google DeepMind

Research Engineer position at Google DeepMind working on cutting-edge ML models and AI technologies in Seattle.

Research Engineer

Google DeepMind

Senior Research Engineer position at Google DeepMind working on Gemini embedding and multimodal AI models