Yann LeCun: World Models: Enabling the next AI revolution

Achieving human-level intelligence requires moving beyond current large language models toward grounded "world models" capable of understanding continuous, high-dimensional, and noisy environments. Intelligence is defined not by the accumulation of declarative knowledge or specific skills, but by the ability to adapt and solve new problems with minimal training. Current generative AI approaches, which focus on pixel-level prediction, fail to capture the underlying structure of the physical world. Instead, the Joint Embedding Predictive Architecture (JEPA) utilizes energy-based models and information maximization to learn abstract representations. This approach enables hierarchical planning and common sense reasoning, allowing systems to predict outcomes and navigate complex tasks safely. By shifting focus from text-based generation to these grounded, predictive architectures, AI can overcome the limitations of current machine learning and move toward more robust, adaptive physical intelligence.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise

Computer Vision and Geometry Group, ETH Zurich

Limitations of Current Machine Learning and the Moravec Paradox

Human Learning Efficiency and the Data Gap

World Model Architecture and Hierarchical Planning

Training Challenges: Generative Models versus Joint Embedding

Abstraction and Practical Implementation of World Models

Yann LeCun: World Models: Enabling the next AI revolution

Computer Vision and Geometry Group, ETH Zurich

00:00Limitations of Current Machine Learning and the Moravec Paradox

Limitations of Current Machine Learning and the Moravec Paradox

06:12Human Learning Efficiency and the Data Gap

Human Learning Efficiency and the Data Gap

12:05World Model Architecture and Hierarchical Planning

World Model Architecture and Hierarchical Planning

20:53Training Challenges: Generative Models versus Joint Embedding

Training Challenges: Generative Models versus Joint Embedding

34:07Abstraction and Practical Implementation of World Models

Abstraction and Practical Implementation of World Models