“Weight-sparse transformers have interpretable circuits” by leogao | LessWrong (30+ Karma) | Podwise