02 Apr 2026
19m

AI Reality Check: AI Reality Check: Can LLMs “Scheme”?

Podcast cover

Deep Questions with Cal Newport

The podcast examines a Guardian article about AI chatbots ignoring human instructions, questioning whether this signals a genuine AI rebellion. It argues the reported rise in AI misbehavior, highlighted by a fivefold increase in incidents, is misleading. The incidents are actually user tweets about the open-source framework OpenClaw causing havoc on personal computers, not evidence of AI sentience. The podcast explains that AI agents, powered by LLMs, function by generating text-based plans without true understanding or intention, using the example of Anthropic's Claude Four Opus to illustrate how LLMs simply "finish stories" based on training data. It concludes that LLMs are unsuitable for autonomous action planning, except in specialized contexts like coding where steps are limited and externally verifiable.

Outlines

Part 1: Debunking AI Rebellion Claims

Part 2: Mechanics and Flaws of AI Agents

Part 3: Specialized Use Cases and Future Solutions

Sign in to continue reading, translating and more.

Open full episode in Podwise