LessWrong (30+ Karma) - “Dodging systematic human errors in scalable oversight” by Benjamin Hilton, Geoffrey Irving
Sign in to continue reading, translating and more.