LessWrong (30+ Karma) - “AI Control: Improving Safety Despite Intentional Subversion” by Buck, Fabien Roger, ryan_greenblatt, Kshitij Sachan
Sign in to continue reading, translating and more.