No Priors Ep. 70 | With Cartesia Co-Founders Karan Goel & Albert Gu | No Priors: AI, Machine Learning, Tech, & Startups

This podcast interviews Karen Goel and Albert Gu, co-founders of Cartesia, about their revolutionary state-space models (SSMs) for sequence modeling, an alternative to transformer-based architectures. The discussion covers the development of SSMs, their advantages in efficiency and specific data types (like audio), and Cartesia's application of these models in their Sonic text-to-speech engine. A key takeaway is that SSMs offer linear scaling, unlike the quadratic scaling of transformers, leading to faster processing and potential for on-device applications. The founders highlight the future direction of Cartesia, focusing on improving Sonic's real-time capabilities and developing multimodal models for more natural and efficient human-computer interaction.

Outlines

Sign in to continue reading, translating and more.

Continue

No Priors Ep. 70 | With Cartesia Co-Founders Karan Goel & Albert Gu

No Priors: AI, Machine Learning, Tech, & Startups

Introduction and Guest Introductions

Cartesia's Sonic Product and Founders' Research Journey

State-Space Models (SSMs) vs. Transformers

Applications of SSMs and Hybrid Models

Cartesia's Focus on Text-to-Speech and Future Directions

Evaluation Metrics, Multimodality, and Cartesia's Future Plans

No Priors Ep. 70 | With Cartesia Co-Founders Karan Goel & Albert Gu

No Priors: AI, Machine Learning, Tech, & Startups

00:05Introduction and Guest Introductions

Introduction and Guest Introductions

02:51Cartesia's Sonic Product and Founders' Research Journey

Cartesia's Sonic Product and Founders' Research Journey

05:13State-Space Models (SSMs) vs. Transformers

State-Space Models (SSMs) vs. Transformers

11:50Applications of SSMs and Hybrid Models

Applications of SSMs and Hybrid Models

17:28Cartesia's Focus on Text-to-Speech and Future Directions

Cartesia's Focus on Text-to-Speech and Future Directions

24:04Evaluation Metrics, Multimodality, and Cartesia's Future Plans

Evaluation Metrics, Multimodality, and Cartesia's Future Plans