LessWrong (30+ Karma) - [Linkpost] “Steering Llama-2 with contrastive activation additions” by Nina Rimsky, Wuschel Schulz, NickGabs, Meg, evhub, TurnTrout
Sign in to continue reading, translating and more.