22 May 2025
2h 24m

Is RL + LLMs enough for AGI? — Sholto Douglas & Trenton Bricken

Podcast cover

Dwarkesh Podcast

In this panel discussion, Dwarkesh, Sholto Douglas, and Trenton Bricken delve into the advancements and future of AI, particularly focusing on reinforcement learning (RL) and its applications in software engineering, computer use, and scientific research. They discuss the importance of feedback loops, the challenges of aligning AI with human values, and the potential for AI to automate white-collar jobs. The conversation covers topics such as the limitations of current models, the role of compute and data, the significance of mechanistic interpretability, and the ethical considerations surrounding increasingly capable AI systems, including the potential for economic disruption and the need for proactive policy-making. The panelists also make predictions about the capabilities of AI agents in the near future and offer advice for those looking to enter the field.

Outlines

Part 1: Introduction and Foundations

Part 2: Techniques and Model Properties

Part 3: Safety and Alignment

Part 4: Evaluation and Interpretability

Part 5: Future Outlook and Research

Sign in to continue reading, translating and more.

Open full episode in Podwise