Debugging misaligned completions with sparse-autoencoder latent attribution | Best AI papers explained | Podwise