“On ‘first critical tries’ in AI alignment” by Joe Carlsmith | LessWrong (30+ Karma) | Podwise