LessWrong (30+ Karma) - “SAEBench: A Comprehensive Benchmark for Sparse Autoencoders” by Can, Adam Karvonen, Johnny Lin, Curt Tigges, Joseph Bloom, chanind, Yeu-Tong Lau, Eoin Farrell, Arthur Conmy, CallumMcDougall, Kola Ayonrinde, Matthew Wearden, Sam Marks, Neel Nanda
Sign in to continue reading, translating and more.
Continue