“Backdoors as an analogy for deceptive alignment ” by Jacob_Hilton | LessWrong (30+ Karma) | Podwise