Episode cover
22 May 2026
59m

Ep 87: Gemini Co-Lead on World Models, RL's Next Domains & Continual Learning

Podcast cover

Unsupervised Learning with Jacob Effron

Multimodal AI and world models are redefining the frontier of machine learning, particularly following recent advancements showcased at Google I/O. Oriol Vinyals, co-lead of Gemini at Google DeepMind, highlights that world models function as compact, conceptual representations of visual and video data, which could eventually accelerate progress in robotics and physical simulation. While current agentic systems often rely on complex, manually coded scaffolding, future architectures will likely allow models to dynamically generate their own operational structures. Memory management remains a significant hurdle, with the field shifting toward nonparametric, file-system-style storage to handle long-term context. Although scaling and broad distribution training remain the primary drivers of intelligence, post-training on high-difficulty domains like mathematics and coding provides necessary reasoning capabilities. These developments suggest that autonomous, self-improving systems are approaching a critical inflection point in their ability to reason and adapt.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise