This podcast episode of "Generative Now," hosted by Michael Mignano and Namdi Regbulam, features a live discussion on AI interpretability with Jack Lindsey from Anthropic and Tom McGrath, co-founder of GoodFire. The conversation explores the increasing importance and urgency of understanding the internal mechanisms of AI models to ensure their safety, reliability, and usefulness. The speakers discuss the technical challenges in achieving interpretability, the potential for AI to assist in the interpretability process, and real-world applications of interpretability in healthcare and other industries. They also touch on potential breakthrough moments in the field, such as building reliable lie detectors for language models and extracting new scientific knowledge from AI models.
Sign in to continue reading, translating and more.
Continue