LessWrong (30+ Karma) - “2025-Era “Reward Hacking” Does Not Show that Reward Is the Optimization Target” by TurnTrout
Sign in to continue reading, translating and more.