LessWrong (30+ Karma) - “Paper: Prompt Optimization Makes Misalignment Legible” by Caleb Biddulph, micahcarroll
Sign in to continue reading, translating and more.