How to do AI analysis you can actually trust

AI's use in customer data analysis is often unreliable due to outputs filled with errors, invented evidence, and generic insights. Caitlin Sullivan shares techniques for extracting trustworthy user insights from LLMs like ChatGPT, Claude, and Gemini. The core issue lies in AI's tendency to generate plausible but often fabricated quotes and its struggle with unstructured interview data and ambiguous survey responses. To combat invented evidence, Sullivan advises defining strict "quote rules" and verifying quotes in AI analysis. To avoid generic insights, she recommends loading prompts with project, business, product, and participant context. Different LLMs have different strengths: Claude for thorough analysis, Gemini for evidenced themes, and ChatGPT for final framing.

Outlines

Part 1: Problem, Context

Part 2: Data Types, Model Comparison

Part 3: Failure Modes, Practical Fixes

Part 4: Conclusion

Sign in to continue reading, translating and more.

Open full episode in Podwise

Lenny's Reads

Part 1: Problem, Context

The Problem with AI: Confident Outputs Full of Lies in User Research

Trustworthy AI Analysis: Challenging Requests and Segmenting Users by Need

Four Failure Modes of AI Analysis and How to Prevent Them

Part 2: Data Types, Model Comparison

The Challenges of Analyzing Interviews and Surveys with AI

LLM Comparison: Claude, Gemini, and ChatGPT for Customer Data Analysis

Choosing the Right LLM: Claude for Analysis, ChatGPT for Communication

Part 3: Failure Modes, Practical Fixes

Failure Mode 1: Invented Evidence and How to Prevent It

Failure Mode 2: False or Generic Insights and How to Avoid Them

The Fix for Generic Insights: Effective Context Loading in Prompts

Part 4: Conclusion

End of Free Preview

How to do AI analysis you can actually trust

Lenny's Reads

Part 1: Problem, Context

00:00The Problem with AI: Confident Outputs Full of Lies in User Research

The Problem with AI: Confident Outputs Full of Lies in User Research

00:55Trustworthy AI Analysis: Challenging Requests and Segmenting Users by Need

Trustworthy AI Analysis: Challenging Requests and Segmenting Users by Need

02:25Four Failure Modes of AI Analysis and How to Prevent Them

Four Failure Modes of AI Analysis and How to Prevent Them

Part 2: Data Types, Model Comparison

03:30The Challenges of Analyzing Interviews and Surveys with AI

The Challenges of Analyzing Interviews and Surveys with AI

05:36LLM Comparison: Claude, Gemini, and ChatGPT for Customer Data Analysis

LLM Comparison: Claude, Gemini, and ChatGPT for Customer Data Analysis

06:54Choosing the Right LLM: Claude for Analysis, ChatGPT for Communication

Choosing the Right LLM: Claude for Analysis, ChatGPT for Communication

Part 3: Failure Modes, Practical Fixes

08:12Failure Mode 1: Invented Evidence and How to Prevent It

Failure Mode 1: Invented Evidence and How to Prevent It

12:25Failure Mode 2: False or Generic Insights and How to Avoid Them

Failure Mode 2: False or Generic Insights and How to Avoid Them

15:53The Fix for Generic Insights: Effective Context Loading in Prompts

The Fix for Generic Insights: Effective Context Loading in Prompts

Part 4: Conclusion

18:04End of Free Preview

End of Free Preview