Training Transformers with Enforced Lipschitz Constants | Xiaol.x | Podwise