YouTube29 Jan 2026

Ex-OpenAI Researcher On Why He Left, His Honest AGI Timeline, & The Limits of Scaling RL

Podcast cover

Unsupervised Learning: Redpoint's AI Podcast

The podcast explores the current state and future directions of AI, particularly focusing on scaling paradigms, reinforcement learning, and continual learning. Jerry Tworek, former VP of Research at OpenAI, shares his insights on the limitations of current AI models, emphasizing their struggles with generalization and continuous learning from failures. He suggests that while scaling pre-training and RL yields predictable improvements, the key question is whether faster, more data-efficient research methods exist. Tworek also reflects on his time at OpenAI, highlighting pivotal decisions like investing in reasoning models and releasing ChatGPT, while also noting the company's challenges in maintaining focus across multiple domains. The conversation touches on the competitive AI landscape, talent acquisition, and the potential societal impacts of widespread automation.

Outlines

Part 1: Introduction, Scaling, and RL Foundations

Part 2: Technical Challenges, Generalization, and AGI

Part 3: OpenAI Insights and Strategic Shifts

Part 4: The Coding Landscape and Market Competition

Part 5: Talent, Research Culture, and Future Outlook

Sign in to continue reading, translating and more.

Open full episode in Podwise