02 Oct 2025
1h 2m

Inside the Black Box: The Urgency of AI Interpretability

Podcast cover

Generative Now | AI Builders on Creating the Future

This podcast episode of "Generative Now," hosted by Michael Mignano and Namdi Regbulam, features a live discussion on AI interpretability with Jack Lindsey from Anthropic and Tom McGrath, co-founder of GoodFire. The conversation explores the increasing importance and urgency of understanding the internal mechanisms of AI models to ensure their safety, reliability, and usefulness. The speakers discuss the technical challenges in achieving interpretability, the potential for AI to assist in the interpretability process, and real-world applications of interpretability in healthcare and other industries. They also touch on potential breakthrough moments in the field, such as building reliable lie detectors for language models and extracting new scientific knowledge from AI models.

Outlines

Part 1: Introduction and Defining Interpretability

Part 2: Urgency, Challenges, and Scaling

Part 3: Applications and Future Directions

Sign in to continue reading, translating and more.

Open full episode in Podwise