How to do AI analysis you can actually trust
Summary
Caitlin Sullivan, an expert in AI-powered user research, identifies four common failure modes when using large language models (LLMs) like ChatGPT, Claude, and Gemini for analyzing customer data from interviews and surveys. These issues include invented evidence (hallucinations or "Frankenstein quotes"), false or generic insights, signal that doesn't guide decisions, and contradictory insights. Sullivan emphasizes that AI outputs often appear confident even when flawed, leading to misguided product decisions. She highlights that LLMs struggle with unstructured interview data by oversimplifying and with survey data by misinterpreting sparse responses or irrelevant metadata. The article also compares LLMs, recommending Claude for thorough analysis, Gemini for evidenced themes, and ChatGPT for final framing, noting ChatGPT's higher propensity for the discussed failure modes.
Key takeaway
For Product Managers and Research Scientists analyzing customer feedback with AI, you must implement rigorous prompting and verification steps to avoid critical errors. Define precise quote rules and use a secondary verification prompt to confirm the authenticity of AI-generated evidence. This prevents acting on invented or generic insights, ensuring your product decisions are grounded in actual customer voice rather than confident AI hallucinations.
Key insights
AI analysis of customer data often yields confident but flawed insights, requiring specific prompting techniques for reliability.
Principles
- AI defaults to finding consensus, losing valuable edge cases.
- LLMs generate text, not retrieve quotes like a search engine.
- Verification is crucial to prevent AI-driven confirmation bias.
Method
Define explicit quote rules (start/end, content, citation) and then use a verification prompt to confirm quote existence and accuracy against source transcripts, flagging paraphrases or unlocated text.
In practice
- Use Claude for in-depth customer data analysis.
- Define clear project, business, and product context for LLMs.
- Verify all AI-generated quotes against original source material.
Topics
- AI User Research
- Prompt Engineering
- LLM Analysis
- AI Hallucinations
- Qualitative Data Analysis
Best for: Product Manager, Research Scientist, AI Data Scientist
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Lenny's Newsletter.