28 Sept 2025
56m

From GPUs-as-a-Service to Workloads-as-a-Service: Flex AI’s Path to High-Utilization AI Infra

Podcast cover

Data Engineering Podcast

In this episode of the Data Engineering Podcast, Tobias Macey interviews Brijesh Tripathi, CEO of Flex AI, about Flex AI, a platform offering a service-oriented abstraction for AI workloads. Brijesh discusses the challenges small teams face in setting up and maintaining infrastructure for AI applications, leading them to become DevOps experts instead of focusing on their core problems. He explains how Flex AI simplifies access to compute, reduces cost unpredictability, and provides a consistent Kubernetes layer. The conversation covers the complexities of GPU-heavy workloads, the shift towards inference, and the importance of workload orchestration. Brijesh emphasizes Flex AI's ability to optimize for training time, manage experimentation loops, and deploy models across multiple clouds and architectures, ultimately enabling founders to concentrate on their business objectives rather than infrastructure management.

Outlines

Part 1: Introduction and Background

Part 2: Infrastructure Challenges and Solutions

Part 3: Flex AI's Approach and Technology

Part 4: User Experience and Applications

Part 5: Lessons, Customer Profile, and Future

Part 6: Conclusion and Outlook

Sign in to continue reading, translating and more.

Open full episode in Podwise