“Training a Reward Hacker Despite Perfect Labels” by ariana_azarbal, vgillioz, TurnTrout | LessWrong (30+ Karma) | Podwise