Prompt Management, Tracing, and Evals: The New Table Stakes for GenAI Ops
Data Engineering Podcast
The podcast explores the operational challenges and blind spots teams face when deploying AI models, particularly LLMs, in production. Aman Agarwal, builder of OpenLit, an AI engineering tool, highlights key issues such as understanding AI response mechanisms, managing token usage costs, and prompt management. Agarwal emphasizes the need for observability and monitoring to understand AI behavior and optimize performance. The conversation covers tools like LangSmith, LangFuse, and TensorZero, discussing the importance of open-source, vendor-agnostic solutions for AI development. OpenLit's architecture, built on OpenTelemetry, aims to provide detailed traces and insights into AI workflows, aiding in debugging and optimization. The podcast further explores experimentation, evaluation, and the significance of context management for improving AI app performance and reliability.
Part 1: Introduction, AI Development Challenges
Part 2: OpenLit Features, Architecture, and Integration
Part 3: Optimization, Standards, and Reliability
Part 4: Evolution, Security, and Practical Insights
Part 5: Community, Lessons, and Future Outlook
Sign in to continue reading, translating and more.
Open full episode in Podwise