18 Oct 2024
12m
“Sabotage Evaluations for Frontier Models” by David Duvenaud, evhub, Joe Benton, Misha Wagner, Eric Christiansen, Ethan Perez, Buck, HoldenKarnofsky, Sam Bowman
LessWrong (30+ Karma)
Open in Podwise to generate AI notes
Sign in to process this episode and unlock summaries, transcripts, highlights and translations.
Shownotes are not generated by Podwise.

