Scaling Stick-Breaking Attention: An Efficient Implementation and In-depth Study | Xiaol.x | Podwise