In this episode of the Practical AI podcast, Alexandre De Fussey, co-founder of the non-profit AI lab Kyutai, shares insights about their innovative open-source speech model, Moshi. The conversation highlights Kyutai's distinctive role in the French AI landscape, the challenges and breakthroughs in developing a full-duplex, low-latency speech model, and the data sets that powered Moshi's training. They also explore future research directions in real-time speech interaction and advancements beyond transformers. Additionally, the episode features sponsorship segments from Fly.io and Timescale.
Outlines
Sign in to continue reading, translating and more.
Open full episode in Podwise