LW - Gated Attention Blocks: Preliminary Progress toward Removing Attention Head Superposition by cmathw | The Nonlinear Library | Podwise