Any-Order GPT as Masked Diffusion Model: Decoupling Formulation and Architecture | Xiaol.x | Podwise