Recursion Is The Next Scaling Law In AI

Recursion at inference time provides a viable alternative to scaling model size for enhancing AI reasoning. Unlike standard LLMs that rely on one-shot feed-forward processes, Hierarchical Reasoning Models (HRM) and Tiny Recursive Models (TRM) utilize iterative computation to tackle incompressible tasks like Sudoku and sorting. By employing a deep equilibrium approach and truncated backpropagation through time, these models leverage latent states as a memory cache, enabling complex reasoning without the prohibitive memory costs of massive transformer architectures. The TRM approach further optimizes this by collapsing hierarchical networks into a single shared-weight module, achieving superior performance on benchmarks like the ARC Prize with significantly fewer parameters. Integrating these compute-efficient recursive methods into general-purpose models offers a promising strategy to overcome the inherent reasoning limitations of current token-based architectures.

Outlines

Sign in to continue reading, translating and more.

Continue

Y Combinator

Limitations of Transformers and the Case for Recursive Reasoning

Hierarchical Reasoning Models and the Role of Memory

Tiny Recursive Models and Architectural Simplification

Future Integration of Recursive Reasoning and Large Language Models

Recursion Is The Next Scaling Law In AI

Y Combinator

00:00Limitations of Transformers and the Case for Recursive Reasoning

Limitations of Transformers and the Case for Recursive Reasoning

07:36Hierarchical Reasoning Models and the Role of Memory

Hierarchical Reasoning Models and the Role of Memory

20:46Tiny Recursive Models and Architectural Simplification

Tiny Recursive Models and Architectural Simplification

34:43Future Integration of Recursive Reasoning and Large Language Models

Future Integration of Recursive Reasoning and Large Language Models