08 Dec 2023
11m
“Refusal mechanisms: initial experiments with Llama-2-7b-chat” by andyrdt, Oscar Balcells Obeso
LessWrong (30+ Karma)
Open in Podwise to generate AI notes
Sign in to process this episode and unlock summaries, transcripts, highlights and translations.
Shownotes are not generated by Podwise.
