15 May 2024
1h 35m

John Schulman (OpenAI Cofounder) — Reasoning, RLHF, & plan for 2027 AGI

Podcast cover

Dwarkesh Podcast

This podcast episode delves into various aspects of AI models, including pre-training and post-training, the potential for longer time horizons in task completion, generalization and affordances, the need for coordination and caution in the face of AGI, the importance of establishing limits and monitoring systems, fine-tuning and context learning, the development of ChatGPT, the progress and potential limitations of language models, the scaling law and sample efficiency, keeping humans in the loop in AI-run companies, the challenges of replicability in social sciences, improving chatbot personality, preference models and the existence of a moat in the field, and the potential of AI assistants. The episode provides insights into the capabilities, challenges, and future potential of AI models.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise