“Toy models of AI control for concentrated catastrophe prevention” by Fabien Roger, Buck | LessWrong (30+ Karma) | Podwise