Research Engineer, Media Understanding- Multimodal Representation Models

Google DeepMind is a leading AI research company focused on advancing artificial intelligence and its applications across Google products.
$215,000 - $250,000
Machine Learning
Senior Software Engineer
In-Person
5,000+ Employees
5+ years of experience
AI

Description For Research Engineer, Media Understanding- Multimodal Representation Models

Google DeepMind is seeking a Research Engineer to join their Media Understanding team, focusing on advancing state-of-the-art research in Embedding/representation models within the context of large language models. This role presents a unique opportunity to work at the intersection of computer vision, language understanding, and multimodal AI systems.

The position is part of a dynamic team comprising research engineers, scientists, and machine learning experts working towards enabling superhuman understanding of the visual world. The primary focus is on developing the most powerful omnimodal embedding models for retrieval and other agentic use cases in Google products.

As a Research Engineer, you'll be at the forefront of developing models that impact billions of users worldwide. Your responsibilities will include conducting core research in computer vision and language understanding, training and evaluating AI models, and implementing cutting-edge deep learning approaches. The role requires collaboration with various teams to drive innovation and product development.

The ideal candidate should possess either a Ph.D. in Computer Science or related field, or a B.S./M.S. with 5+ years of relevant experience. Strong research experience, publication record, and hands-on experience with Google-scale infrastructure would be advantageous. The position offers competitive compensation ranging from $215,000 to $250,000, plus bonus, equity, and comprehensive benefits.

This role represents an exceptional opportunity to shape the future of multimodal AI while working with world-class researchers and engineers. You'll be contributing to groundbreaking research that directly impacts how Google products understand and interact with diverse media formats, including text, images, audio, and video.

Join Google DeepMind's commitment to diversity and innovation, where your unique perspective will contribute to creating extraordinary impact in the field of artificial intelligence and machine learning.

Last updated a month ago

Responsibilities For Research Engineer, Media Understanding- Multimodal Representation Models

  • Conducting core research in computer vision, language understanding, multimodal models, and large scale AI models
  • Training and evaluating AI models for various product use cases
  • Implementing and adapting state of the art deep learning approaches
  • Collaborating with GDM and partner teams to build advanced embedding models
  • Transform prototypes into scalable solutions for Google's products

Requirements For Research Engineer, Media Understanding- Multimodal Representation Models

Python
  • Ph.D. in Computer Science or related field, or B.S./M.S. with 5+ years of relevant experience
  • Experience in machine learning models and techniques
  • Ability to transform prototypes into scalable solutions
  • Research and problem-solving capabilities
  • Strong collaboration skills
  • Experience with core software engineering and applied AI implementations
  • Publication record in top tier conferences (preferred)
  • Experience with Google-scale infrastructure (preferred)

Benefits For Research Engineer, Media Understanding- Multimodal Representation Models

Medical Insurance
401k
  • Bonus
  • Equity
  • Benefits Package

Interested in this job?

Jobs Related To Google DeepMind Research Engineer, Media Understanding- Multimodal Representation Models

Research Engineer

Research Engineer position at Google DeepMind working on cutting-edge ML models and AI technologies in Seattle.

Software Engineer, LLM Pre-Training Optimization

Senior Software Engineer role at Google DeepMind focusing on optimizing pre-training efficiency for large language models using TPUs and custom kernels.

Research Engineer

Senior Research Engineer position at Google DeepMind working on Gemini embedding and multimodal AI models

Software Engineer - Health Agent

Senior Software Engineer role developing next-generation AI health agents using Astra and Gemini technologies at Google DeepMind.

Software Engineer

Senior Software Engineer role at Google DeepMind focusing on AI agent development and implementation, requiring 5+ years of experience in software development.