23 Jan 2025
45m

Video generation with realistic motion

Podcast cover

Practical AI

This interview podcast episode of Practical AI focuses on the advancements and challenges in video generation AI. The hosts discuss the evolution of video generation models, from early GANs to the current state-of-the-art diffusion models like OpenAI's Sora. The guest, Paras Jain, CEO of Genmo, details the complexities of training these models, highlighting the massive data requirements and the difficulty in achieving realistic motion and prompt adherence. Genmo's open-sourced Mochi model is presented as a significant contribution, aiming to improve motion and prompt following, and the podcast explores how this technology is impacting content creation workflows, including editing capabilities demonstrated by community-built tools like Mochi Edit. The discussion concludes with a vision for the future of video generation, emphasizing its potential to democratize content creation and empower creators globally.

Outlines

Part 1: Introduction to Video Generation

Part 2: Challenges and Evaluation

Part 3: Genmo's Model Development

Part 4: Applications and Future Impact

Sign in to continue reading, translating and more.

Open full episode in Podwise