“How Can Interpretability Researchers Help AGI Go Well?” by Neel Nanda | LessWrong (30+ Karma) | Podwise