“Reward hacking is becoming more sophisticated and deliberate in frontier LLMs” by Kei | LessWrong (30+ Karma) | Podwise