“Tensor-Transformer Variants are Surprisingly Performant” by Logan Riggs | LessWrong (30+ Karma) | Podwise