16 Oct 2025
1h 16m

How GPT-5 Thinks — OpenAI VP of Research Jerry Tworek

Podcast cover

The MAD Podcast with Matt Turck

In this episode of The MAD Podcast, Matt Turck interviews Jerry Tworek, VP of Research at OpenAI, about the evolution of AI reasoning models. They discuss the concept of reasoning in AI, particularly the "chain of thought" process, and how models decide how long to "think" to balance result quality and user experience. Tworek shares insights into the development of OpenAI's reasoning models, including O1, O3, and GPT-5, and his journey from mathematics and trading to becoming a leading AI researcher. He also provides a behind-the-scenes look at OpenAI's research culture, project prioritization, and the balance between collaboration and IP protection. The conversation further explores the roles of pre-training and reinforcement learning (RL) in creating modern AI systems, with a detailed explanation of RL principles and its application in various domains, including math, coding, and general problem-solving.

Outlines

Part 1: Reasoning and AI Models

Part 2: Pre-training and Reinforcement Learning

Part 3: The Future of AGI

Sign in to continue reading, translating and more.

Open full episode in Podwise