MIT EI seminar, Hyung Won Chung from OpenAI. "Don't teach. Incentivize."

This podcast episode features Hyung Won Chung from OpenAI discussing the pivotal role of scalability in AI research, emphasizing how leveraging advancements in compute power can enhance AI models. Chung advocates for reframing our understanding of scaling by moving beyond simply increasing resources and instead addressing underlying modeling assumptions that limit performance. He expounds on the success of large language models (LLMs), particularly through the lens of next token prediction as a method of implicit multitask learning. As he draws distinctions between generalists and specialists in AI, he highlights the importance of incentive structures and emergent abilities, which arise at higher scales. Ultimately, Chung encourages the research community to prioritize general skill development in AI systems, paving the way for innovative applications.

Outlines

Sign in to continue reading, translating and more.

Continue

Hyung Won Chung

The Importance of Scalability in AI Research

Redefining Scaling in AI: Beyond "More Machines"

The Magic of Next Token Prediction in Language Models

The Implicit Multitask Learning Hypothesis: Why Next Token Prediction Works

The Power of Induced Incentives in AI Learning

Generalists vs. Specialists: The Illusion of Tradeoffs in AI

The Threshold of Intelligence and the Relevance of Incentive Structures

Understanding Emergent Abilities in AI: The "Yet" Perspective

A New Paradigm for AI Learning: Incentivizing General Skills

MIT EI seminar, Hyung Won Chung from OpenAI. "Don't teach. Incentivize."

Hyung Won Chung

00:05The Importance of Scalability in AI Research

The Importance of Scalability in AI Research

09:41Redefining Scaling in AI: Beyond "More Machines"

Redefining Scaling in AI: Beyond "More Machines"

11:18The Magic of Next Token Prediction in Language Models

The Magic of Next Token Prediction in Language Models

17:01The Implicit Multitask Learning Hypothesis: Why Next Token Prediction Works

The Implicit Multitask Learning Hypothesis: Why Next Token Prediction Works

22:27The Power of Induced Incentives in AI Learning

The Power of Induced Incentives in AI Learning

26:59Generalists vs. Specialists: The Illusion of Tradeoffs in AI

Generalists vs. Specialists: The Illusion of Tradeoffs in AI

28:43The Threshold of Intelligence and the Relevance of Incentive Structures

The Threshold of Intelligence and the Relevance of Incentive Structures

30:02Understanding Emergent Abilities in AI: The "Yet" Perspective

Understanding Emergent Abilities in AI: The "Yet" Perspective

34:54A New Paradigm for AI Learning: Incentivizing General Skills

A New Paradigm for AI Learning: Incentivizing General Skills