LessWrong (30+ Karma) - “Prefill awareness: can LLMs tell when “their” message history has been tampered with?” by David Africa
Sign in to continue reading, translating and more.