LessWrong (30+ Karma) - “Feature Targeted LLC Estimation Distinguishes SAE Features from Random Directions” by Lidor Banuel Dabbah, Aviel Boag
Sign in to continue reading, translating and more.