Prompt Management, Tracing, and Evals: The New Table Stakes for GenAI Ops

The podcast explores the operational challenges and blind spots teams face when deploying AI models, particularly LLMs, in production. Aman Agarwal, builder of OpenLit, an AI engineering tool, highlights key issues such as understanding AI response mechanisms, managing token usage costs, and prompt management. Agarwal emphasizes the need for observability and monitoring to understand AI behavior and optimize performance. The conversation covers tools like LangSmith, LangFuse, and TensorZero, discussing the importance of open-source, vendor-agnostic solutions for AI development. OpenLit's architecture, built on OpenTelemetry, aims to provide detailed traces and insights into AI workflows, aiding in debugging and optimization. The podcast further explores experimentation, evaluation, and the significance of context management for improving AI app performance and reliability.

Outlines

Part 1: Introduction, AI Development Challenges

Part 2: OpenLit Features, Architecture, and Integration

Part 3: Optimization, Standards, and Reliability

Part 4: Evolution, Security, and Practical Insights

Part 5: Community, Lessons, and Future Outlook

Sign in to continue reading, translating and more.

Open full episode in Podwise

Data Engineering Podcast

Part 1: Introduction, AI Development Challenges

Democratizing Data Access with Retool for AI Model Optimization

AI Development Blind Spots: Prompt Interception, Cost, and Management

The Need for Observability and Open Source Tools in AI App Development

Part 2: OpenLit Features, Architecture, and Integration

OpenLit's Open-Source Approach to AI Tooling and MVP Creation

OpenLit's Fleet Hub and Kubernetes Operator for AI App Management

Experimentation and Evaluation Features in OpenLit for LLM Optimization

OpenLit's Integration with Existing Platforms and Detailed Trace Information

Part 3: Optimization, Standards, and Reliability

Observability for Confident Model Selection and Cost Optimization

OpenTelemetry-Based Design and Extensibility of OpenLit

Reliability and Safe Defaults in OpenLit's Prompt Management

Part 4: Evolution, Security, and Practical Insights

Evolution of OpenLit's Scope and Goals: From Telemetry to AI Engineering Tool

OpenLit's Platform Approach and Focus on Data Security

Leveraging OpenLit Insights for Self-Improvement in Agentic Use Cases

Sharp Edges and Blind Spots in LLM Operations Despite Observability

Part 5: Community, Lessons, and Future Outlook

Surprising Applications and Community Adoption of OpenLit

Lessons Learned: Community Engagement and Prioritizing Developer Experience

When OpenLit Might Not Be the Right Choice

Future Focus: Context Management and Emotional Intelligence in AI

Closing Remarks and Other Podcast Recommendations

Prompt Management, Tracing, and Evals: The New Table Stakes for GenAI Ops

Data Engineering Podcast

Part 1: Introduction, AI Development Challenges

00:11Democratizing Data Access with Retool for AI Model Optimization

Democratizing Data Access with Retool for AI Model Optimization

01:14AI Development Blind Spots: Prompt Interception, Cost, and Management

AI Development Blind Spots: Prompt Interception, Cost, and Management

05:09The Need for Observability and Open Source Tools in AI App Development

The Need for Observability and Open Source Tools in AI App Development

Part 2: OpenLit Features, Architecture, and Integration

09:47OpenLit's Open-Source Approach to AI Tooling and MVP Creation

OpenLit's Open-Source Approach to AI Tooling and MVP Creation

12:33OpenLit's Fleet Hub and Kubernetes Operator for AI App Management

OpenLit's Fleet Hub and Kubernetes Operator for AI App Management

15:55Experimentation and Evaluation Features in OpenLit for LLM Optimization

Experimentation and Evaluation Features in OpenLit for LLM Optimization

18:29OpenLit's Integration with Existing Platforms and Detailed Trace Information

OpenLit's Integration with Existing Platforms and Detailed Trace Information

Part 3: Optimization, Standards, and Reliability

22:46Observability for Confident Model Selection and Cost Optimization

Observability for Confident Model Selection and Cost Optimization

24:41OpenTelemetry-Based Design and Extensibility of OpenLit

OpenTelemetry-Based Design and Extensibility of OpenLit

28:22Reliability and Safe Defaults in OpenLit's Prompt Management

Reliability and Safe Defaults in OpenLit's Prompt Management

Part 4: Evolution, Security, and Practical Insights

31:24Evolution of OpenLit's Scope and Goals: From Telemetry to AI Engineering Tool

Evolution of OpenLit's Scope and Goals: From Telemetry to AI Engineering Tool

34:04OpenLit's Platform Approach and Focus on Data Security

OpenLit's Platform Approach and Focus on Data Security

36:14Leveraging OpenLit Insights for Self-Improvement in Agentic Use Cases

Leveraging OpenLit Insights for Self-Improvement in Agentic Use Cases

38:18Sharp Edges and Blind Spots in LLM Operations Despite Observability

Sharp Edges and Blind Spots in LLM Operations Despite Observability

Part 5: Community, Lessons, and Future Outlook

39:45Surprising Applications and Community Adoption of OpenLit

Surprising Applications and Community Adoption of OpenLit

42:20Lessons Learned: Community Engagement and Prioritizing Developer Experience

Lessons Learned: Community Engagement and Prioritizing Developer Experience

44:19When OpenLit Might Not Be the Right Choice

When OpenLit Might Not Be the Right Choice

46:42Future Focus: Context Management and Emotional Intelligence in AI

Future Focus: Context Management and Emotional Intelligence in AI

50:03Closing Remarks and Other Podcast Recommendations

Closing Remarks and Other Podcast Recommendations