“Weight-Sparse Circuits May Be Interpretable Yet Unfaithful” by jacob_drori | LessWrong (30+ Karma) | Podwise