Multi-Cluster Orchestrator, with Nick Eberts and Jon Li

In this episode of the Kubernetes Podcast from Google, recorded live at Google Cloud Next 2025, hosts Abdel and guests Nick Eberts and Jon Li discuss Multi Cluster Orchestrator (MCO), a new open-source tool announced at KubeCon EU 2025. Jon explains the problem MCO aims to solve, addressing the limitations of the "infinite and uniform" cloud assumption with the rise of accelerators like GPUs, which face stock outs and varying generations. Nick elaborates on the need for a middle ground between one massive cluster and numerous small ones, highlighting the importance of bin packing applications and ensuring high availability across regions. He details how MCO helps manage workloads across multiple Kubernetes clusters, especially for expensive accelerated hardware, by recommending optimal cluster placement based on priorities and capacity, and scaling workloads from zero to one based on demand. The discussion also covers the Cluster Inventory API and Cluster Profile API, components MCO builds upon, and how these profiles are automatically generated and updated to reflect cluster capabilities. Jon discusses how MCO will integrate with the inference extension, creating a two-layer approach to routing, first by picking a region and then balancing the load among accelerators within that region.

Outlines

Sign in to continue reading, translating and more.

Continue

Kubernetes Podcast from Google

Introduction to Multi Cluster Orchestrator (MCO) and the Problem It Solves

MCO's Functionality, Architecture, and Cluster Inventory

MCO Availability, Integration with Inference Gateways, and Autoscaling

Conclusion and Future Plans for MCO

Multi-Cluster Orchestrator, with Nick Eberts and Jon Li

Kubernetes Podcast from Google

00:00Introduction to Multi Cluster Orchestrator (MCO) and the Problem It Solves

Introduction to Multi Cluster Orchestrator (MCO) and the Problem It Solves

07:35MCO's Functionality, Architecture, and Cluster Inventory

MCO's Functionality, Architecture, and Cluster Inventory

15:41MCO Availability, Integration with Inference Gateways, and Autoscaling

MCO Availability, Integration with Inference Gateways, and Autoscaling

20:06Conclusion and Future Plans for MCO

Conclusion and Future Plans for MCO