The Nonlinear Library - LW - Meta-level adversarial evaluation of oversight techniques might allow robust measurement of their adequacy by Buck
Sign in to continue reading, translating and more.