ArXiv Preprint - De-Diffusion Makes Text a Strong Cross-Modal Interface | AI Breakdown | Podwise