LLM-D, with Clayton Coleman and Rob Shaw

In this episode of the Kubernetes Podcast from Google, hosts Kaslin Fields and Mofir Ahmed interview Clayton Coleman and Rob Shaw about running large language models (LLMs) on Kubernetes. The discussion covers why LLMs present unique challenges compared to traditional web applications, focusing on resource usage, scale, and the need for specialized load balancing. They delve into the role of projects like Inference Gateway and vLLM in optimizing LLM serving, and how LLMD aims to integrate these tools to create well-lit paths for production-grade inference applications. The guests also discuss the importance of open-source collaboration in this rapidly evolving field, and speculate on the future of AI model serving within Kubernetes, highlighting the shift towards open innovation, hardware advancements, and the rise of agentic applications. The episode also touches on Kubernetes 1.34 release, Qcrash event, and CNCF's top open-source projects.

Outlines

Sign in to continue reading, translating and more.

Continue

Kubernetes Podcast from Google

Introduction to LLMs on Kubernetes with Clayton Coleman and Rob Shaw

The Unique Challenges of LLMs on Kubernetes

vLLM's Role in Serving Large Language Models

LLMD: Merging Communities and Driving Requirements

The Benefits of Contributing to Open Source LLM Projects

The Future of LLMs and a Call to Action

Key Takeaways and Final Thoughts

LLM-D, with Clayton Coleman and Rob Shaw

Kubernetes Podcast from Google

00:00Introduction to LLMs on Kubernetes with Clayton Coleman and Rob Shaw

Introduction to LLMs on Kubernetes with Clayton Coleman and Rob Shaw

01:01The Unique Challenges of LLMs on Kubernetes

The Unique Challenges of LLMs on Kubernetes

07:32vLLM's Role in Serving Large Language Models

vLLM's Role in Serving Large Language Models

15:11LLMD: Merging Communities and Driving Requirements

LLMD: Merging Communities and Driving Requirements

27:05The Benefits of Contributing to Open Source LLM Projects

The Benefits of Contributing to Open Source LLM Projects

39:31The Future of LLMs and a Call to Action

The Future of LLMs and a Call to Action

41:14Key Takeaways and Final Thoughts

Key Takeaways and Final Thoughts