How to Run Evals in Claude Code with Aparna Dhinakaran, Founder and CPO of Arize

Product management in the AI era centers on cultivating "product taste" by building iterative loops that transform user feedback into actionable agent improvements. Modern AI PMs must bridge the technical gap by utilizing terminal-based tools like Claude Code to build, instrument, and evaluate agents directly. Observability through tracing is essential for this process, as it provides the granular data needed to identify performance failures and refine evaluation metrics. Rather than relying on static roadmaps, successful teams implement self-improving cycles where agents analyze their own traces to prioritize bugs and feature requests. Aparna Dhinakaran, CPO and co-founder of Arize AI, emphasizes that the most effective PMs treat these data-driven feedback loops as a foundational layer, enabling them to ship high-impact solutions at unprecedented velocity while maintaining rigorous standards for accuracy and alignment.

Outlines

Sign in to continue reading, translating and more.

Open full episode in Podwise

The Growth Podcast

Cultivating Product Taste with AI-Driven PM Agents

Instrumenting Agents for Real-Time Observability

Establishing Baseline Evals for Priority Accuracy

Architecting Self-Improving Agent Workflows

The Evolving Profile of the AI-Native Product Manager

Enterprise Roadmap for AI Integration and Data Context

How to Run Evals in Claude Code with Aparna Dhinakaran, Founder and CPO of Arize

The Growth Podcast

00:00Cultivating Product Taste with AI-Driven PM Agents

Cultivating Product Taste with AI-Driven PM Agents

13:15Instrumenting Agents for Real-Time Observability

Instrumenting Agents for Real-Time Observability

23:47Establishing Baseline Evals for Priority Accuracy

Establishing Baseline Evals for Priority Accuracy

43:40Architecting Self-Improving Agent Workflows

Architecting Self-Improving Agent Workflows

56:33The Evolving Profile of the AI-Native Product Manager

The Evolving Profile of the AI-Native Product Manager

1:04:49Enterprise Roadmap for AI Integration and Data Context

Enterprise Roadmap for AI Integration and Data Context