LW - You can remove GPT2's LayerNorm by fine-tuning for an hour by StefanHex | The Nonlinear Library | Podwise