LessWrong (30+ Karma) - “Weight-sparse transformers have interpretable circuits” by leogao
Sign in to continue reading, translating and more.