YouTube20 Dec 2024
1h 46m

Building an ML Platform from scratch

Podcast cover

MLOps.community

The podcast explores the challenges and potential of building MLOps systems, particularly focusing on the use of SQL Mesh, DuckDB, Prefect, and GitHub for managing data transformations and machine learning workflows. The hosts debug issues encountered while connecting models in SQL Mesh, emphasizing the importance of linting and documentation to avoid common errors related to SQL syntax and time zone handling. They discuss the benefits of SQL Mesh's state management and virtual environments for streamlining data pipelines and preventing redundant queries. The conversation shifts to feature engineering, debating the merits of monolithic versus decoupled pipelines and the role of feature stores in promoting reusability and collaboration among data scientists. The hosts also touch on the balance between rapid iteration and code quality, highlighting the need for testing and validation in complex systems.

Outlines

Part 1: Troubleshooting, Setup

Part 2: SQL Mesh UI, State Management

Part 3: Orchestration, Integration

Part 4: Architecture, Best Practices

Sign in to continue reading, translating and more.

Open full episode in Podwise