Attention with Trained Embeddings Provably Selects Important Tokens | Xiaol.x | Podwise