“Reporting Tasks as Reward-Hackable: Better Than Inoculation Prompting?” by RogerDearnaley | LessWrong (30+ Karma) | Podwise