Microsoft Research - Synchronized Audio-Visual Generation with a Joint Generative Diffusion Model and Contrastive Loss
Sign in to continue reading, translating and more.