LessWrong (30+ Karma) - “ImpossibleBench: Measuring Reward Hacking in LLM Coding Agents” by Ziqian Zhong
Sign in to continue reading, translating and more.