06 Feb 2025
4m
“Detecting Strategic Deception Using Linear Probes” by Nicholas Goldowsky-Dill, bilalchughtai, StefanHex, Marius Hobbhahn
LessWrong (30+ Karma)
Open in Podwise to generate AI notes
Sign in to process this episode and unlock summaries, transcripts, highlights and translations.
Shownotes are not generated by Podwise.

