Why Not Just Train For Interpretability? | LessWrong (30+ Karma) | Podwise