Causal Interpretation of Transformer Self-Attention | Best AI papers explained | Podwise