In this episode of Google's podcast on SRE and production software, host Steve McGhee and Jordan Greenberg interview Karanveer Anand, a technical program manager (TPM) with a background in SRE. They discuss the role of TPMs in SRE, highlighting the importance of translating technical concepts into business terms and managing projects while ensuring production services remain operational. Karanveer shares his experience in partitioning software infrastructure to prevent global outages and migrating services to the latest AI models within Workspace, emphasizing the need for tracking, pilot testing, and cross-functional collaboration. The conversation also covers the concept of a "buffer number" for resource planning in SRE and explores how AI can be leveraged to enhance the SRE and TPM relationship, particularly in postmortem analysis and risk assessment.
Sign in to continue reading, translating and more.
Continue