Episode cover
YouTube21 May 2026

OpenAI's Yann Dubois: Why AI Progress Suddenly Feels Real

Podcast cover

The MAD Podcast with Matt Turck

Frontier AI development has shifted from optimizing models for verifiable benchmarks like math and coding competitions toward enhancing reliability for messy, real-world utility. Yann Dubois, co-lead of the post-training frontiers team at OpenAI, explains that this transition relies on scaling reinforcement learning to prioritize user productivity over narrow, competition-based tasks. While pre-training provides foundational knowledge, the post-training phase—specifically supervised fine-tuning and reinforcement learning—is essential for aligning models with human intent and improving reasoning efficiency. Despite rapid progress, challenges remain in achieving true continual learning and managing the "last mile" of application-specific integration. Current AI systems demonstrate high initial utility, yet they struggle to evolve alongside enterprise knowledge, highlighting a critical need for better memory and personalization frameworks to move beyond static, harness-dependent implementations.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise