“Do Models Continue Misaligned Actions?” by Jordan Taylor | LessWrong (30+ Karma) | Podwise