YouTube09 Apr 2026
58m

OpenAI’s Chief Scientist on Continual Learning Hype, RL Beyond Code, & Future Alignment Directions

Podcast cover

Unsupervised Learning: Redpoint's AI Podcast

The podcast explores the progress and future of AI research, focusing on achieving research intern-level AI and fully automated AI researchers. It highlights the explosive growth of coding tools like Codex and advancements in math and physics capabilities, which serve as benchmarks for improving AI reasoning. The conversation covers the challenges of evaluating AI progress in domains like medicine and law, emphasizing the importance of models assessing their own partial progress and the need for longer-term consistency. It also addresses whether companies should invest in reinforcement learning or rely on contextual learning, and the potential for AI to collaborate with humans in scientific research. The guest shares insights on AI safety, particularly the use of chain-of-thought monitoring to understand model motivations and generalization.

Outlines

Part 1: AI Research Goals, Capabilities

Part 2: Reasoning, Benchmarks, RL

Part 3: Product, Interface, Research Strategy

Part 4: Learning, Environment, Science

Part 5: Safety, Alignment, Monitoring

Part 6: Organization, Robotics, Society

Sign in to continue reading, translating and more.

Open full episode in Podwise