“Contrastive features elicit different perturbation responses than SAE features” by Francisco Ferreira da Silva, StefanHex | LessWrong (30+ Karma) | Podwise