“A Toy Environment For Exploring Reasoning About Reward” by jenny, Bronson Schoen | LessWrong (30+ Karma) | Podwise