Some thoughts on the Sutton interview

Dwarkesh reflects on a previous interview with Richard Sutton, focusing on Sutton's "Bitter Lesson" essay and its implications for AI development. Dwarkesh interprets Sutton's argument as advocating for AI techniques that effectively leverage compute, criticizing current LLMs for inefficient training methods, reliance on human data, and the creation of models that predict human responses rather than developing true world models. Dwarkesh disagrees with Sutton's sharp distinctions between LLMs and true intelligence, arguing that imitation learning and RL are complementary, and that models of humans can facilitate the development of true world models. He suggests that continual learning could be integrated into LLMs and that while current LLMs may have limitations, they are undergoing RL on ground truth and paving the way for future AI systems based on Sutton's principles.

Outlines

Sign in to continue reading, translating and more.

Continue

Dwarkesh Podcast

Understanding Richard Sutton's Perspective on LLMs

Contrasting Views on Imitation Learning and Reinforcement Learning

The Role of Imitation Learning in Achieving AGI

LLMs, World Models, and Continual Learning

Concluding Thoughts on LLMs and the Path to AGI

Some thoughts on the Sutton interview

Dwarkesh Podcast

00:00Understanding Richard Sutton's Perspective on LLMs

Understanding Richard Sutton's Perspective on LLMs

02:38Contrasting Views on Imitation Learning and Reinforcement Learning

Contrasting Views on Imitation Learning and Reinforcement Learning

04:50The Role of Imitation Learning in Achieving AGI

The Role of Imitation Learning in Achieving AGI

07:35LLMs, World Models, and Continual Learning

LLMs, World Models, and Continual Learning

10:31Concluding Thoughts on LLMs and the Path to AGI

Concluding Thoughts on LLMs and the Path to AGI