LessWrong (30+ Karma) - “On ‘first critical tries’ in AI alignment” by Joe Carlsmith
Sign in to continue reading, translating and more.