25 Sept 2024
28m

Creating Systems that are Safe with Liz Fong-Jones

Podcast cover

Google SRE Prodcast

This episode kicks off Season 3, focusing on the complexities of software design and development within the Site Reliability Engineering (SRE) framework. Host Steve McGhee, co-host Jordan Greenberg, and guest Liz Fong-Jones explore key topics like the difference between observability and monitoring, how improved observability boosts deployment confidence, and the impact of Service Level Objectives (SLOs) on user satisfaction. They emphasize essential principles, such as prioritizing practical solutions over mere hope, understanding the intricacies of distributed systems, and using AI as a supportive tool rather than a substitute for human insight. Their engaging discussion and vivid analogies shine a light on the evolving significance of observability in today's software development landscape.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise