04 Dec 2024
50m

Full-duplex, real-time dialogue with Kyutai

Podcast cover

Practical AI

In this episode of the Practical AI podcast, Alexandre De Fussey, co-founder of the non-profit AI lab Kyutai, shares insights about their innovative open-source speech model, Moshi. The conversation highlights Kyutai's distinctive role in the French AI landscape, the challenges and breakthroughs in developing a full-duplex, low-latency speech model, and the data sets that powered Moshi's training. They also explore future research directions in real-time speech interaction and advancements beyond transformers. Additionally, the episode features sponsorship segments from Fly.io and Timescale.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise