LessWrong (30+ Karma) - “How hard is it to inoculate against misalignment generalization?” by Jozdien
Sign in to continue reading, translating and more.