Arxiv paper - Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens | AI Breakdown | Podwise