“Towards Multimodal Interpretability: Learning Sparse Interpretable Features in Vision Transformers” by hugofry | LessWrong (30+ Karma) | Podwise