“How to Design Environments for Understanding Model Motives” by gersonkroiz, aditya singh, Senthooran Rajamanoharan, Neel Nanda | LessWrong (30+ Karma) | Podwise