YouTube12 Feb 2026

The AI Frontier: from Gemini 3 Deep Think distilling to Flash — Jeff Dean

Podcast cover

Latent Space

Jeff Dean, Chief AI Scientist at Google, explores the balance between frontier AI model development and practical deployment. The conversation highlights Google's strategy of maintaining both highly capable and affordable models, leveraging techniques like distillation to transfer capabilities from large models to smaller, more efficient ones, such as the Gemini Flash model, which powers AI features across Google products like Search and Gmail. Dean emphasizes the importance of low-latency systems for complex tasks and the role of TPUs in enabling long-context attention operations. He also touches on the shift towards unified models capable of multimodal understanding, including non-human modalities like LiDAR and genomics, and the potential for personalized AI through models that can access and reason over an individual's data.

Outlines

Part 1: Model Strategy, Distillation, and Context

Part 2: Modalities, Search, and System Design

Part 3: Research, Reasoning, and Scaling

Part 4: Future of Engineering

Sign in to continue reading, translating and more.

Open full episode in Podwise