“‘Behaviorist’ RL reward functions lead to scheming” by Steven Byrnes | LessWrong (30+ Karma) | Podwise