YouTube27 Jun 2025
53m

From Mixture of Experts to Mixture of Agents with Super Fast Inference - Daniel Kim & Daria Soboleva

Podcast cover

AI Engineer

The podcast is a workshop and Q&A session led by Daria Soboleva and Daniel Kim from Cerebras. They introduce the concept of Mixture of Experts and Mixture of Agents, explaining how these architectures improve large language models. The workshop involves a hands-on activity where attendees build an application using Mixture of Agents to generate and optimize Python code. The session includes a challenge to achieve the highest score on an automated grader, followed by a Q&A where the hosts answer questions about Cerebras hardware, model onboarding, power consumption, and the application of Mixture of Agents.

Outlines

Part 1: Workshop Introduction and Cerebras Overview

Part 2: Mixture of Agents and Cerebras Advantage

Part 3: Hands-On Workshop and AI Challenge

Part 4: Q&A and Future Directions

Sign in to continue reading, translating and more.

Open full episode in Podwise