arxiv preprint - Masked Generative Video-to-Audio Transformers with Enhanced Synchronicity | AI Breakdown | Podwise