“Open Source Replication of Anthropic’s Crosscoder paper for model-diffing” by Connor Kissane, robertzk, Arthur Conmy, Neel Nanda | LessWrong (30+ Karma) | Podwise
LessWrong (30+ Karma) - “Open Source Replication of Anthropic’s Crosscoder paper for model-diffing” by Connor Kissane, robertzk, Arthur Conmy, Neel Nanda
Sign in to continue reading, translating and more.