09 Jan 2026
9m
“Alignment Faking is a Linear Feature in Anthropic’s Hughes Model” by James Hoffend
LessWrong (30+ Karma)
Open in Podwise to generate AI notes
Sign in to process this episode and unlock summaries, transcripts, highlights and translations.
Shownotes are not generated by Podwise.

