How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek | The MAD Podcast with Matt Turck

In this episode of The MAD Podcast, Matt Turck interviews Jerry Tworek, VP of Research at OpenAI, about the evolution of AI reasoning models. They discuss the concept of reasoning in AI, particularly the "chain of thought" process, and how models decide how long to "think" to balance result quality and user experience. Tworek shares insights into the development of OpenAI's reasoning models, including O1, O3, and GPT-5, and his journey from mathematics and trading to becoming a leading AI researcher. He also provides a behind-the-scenes look at OpenAI's research culture, project prioritization, and the balance between collaboration and IP protection. The conversation further explores the roles of pre-training and reinforcement learning (RL) in creating modern AI systems, with a detailed explanation of RL principles and its application in various domains, including math, coding, and general problem-solving.

Outlines

Part 1: Reasoning and AI Models

Part 2: Pre-training and Reinforcement Learning

Part 3: The Future of AGI

Sign in to continue reading, translating and more.

Continue

How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek

The MAD Podcast with Matt Turck

Part 1: Reasoning and AI Models

Introduction to Reasoning in AI Models

Balancing Reasoning Time and User Experience in AI

Jerry Tworek's Early Life and Path to AI

Discovering Reinforcement Learning and Joining OpenAI

A Day in the Life at OpenAI and Research Priorities

OpenAI's Culture of Transparency and Rapid Innovation

Part 2: Pre-training and Reinforcement Learning

Pre-training and Reinforcement Learning in AI Models

The Evolution of Reinforcement Learning and the Role of RLHF

Unsupervised Learning and the GRPO Release

Scaling Reinforcement Learning and the Importance of Automation

Agentic AI, Online RL, and Alignment

AI's Success in Programming Competitions and Generalization of RL

Part 3: The Future of AGI

The Path to AGI and the Role of Pre-training and RL

Concluding Thoughts

How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek

The MAD Podcast with Matt Turck

Part 1: Reasoning and AI Models

00:00Introduction to Reasoning in AI Models

Introduction to Reasoning in AI Models

05:24Balancing Reasoning Time and User Experience in AI

Balancing Reasoning Time and User Experience in AI

10:54Jerry Tworek's Early Life and Path to AI

Jerry Tworek's Early Life and Path to AI

17:20Discovering Reinforcement Learning and Joining OpenAI

Discovering Reinforcement Learning and Joining OpenAI

23:31A Day in the Life at OpenAI and Research Priorities

A Day in the Life at OpenAI and Research Priorities

29:27OpenAI's Culture of Transparency and Rapid Innovation

OpenAI's Culture of Transparency and Rapid Innovation

Part 2: Pre-training and Reinforcement Learning

35:12Pre-training and Reinforcement Learning in AI Models

Pre-training and Reinforcement Learning in AI Models

42:06The Evolution of Reinforcement Learning and the Role of RLHF

The Evolution of Reinforcement Learning and the Role of RLHF

47:52Unsupervised Learning and the GRPO Release

Unsupervised Learning and the GRPO Release

53:01Scaling Reinforcement Learning and the Importance of Automation

Scaling Reinforcement Learning and the Importance of Automation

57:56Agentic AI, Online RL, and Alignment

Agentic AI, Online RL, and Alignment

1:02:30AI's Success in Programming Competitions and Generalization of RL

AI's Success in Programming Competitions and Generalization of RL

Part 3: The Future of AGI

1:09:14The Path to AGI and the Role of Pre-training and RL

The Path to AGI and the Role of Pre-training and RL

1:15:20Concluding Thoughts

Concluding Thoughts