Anthropic's Big AI Design Change: The RED Pill
Summary
Anthropic's recent publication, "How Anthropic enables the self-service data analytics with Claude," reports its Claude LLM achieves only 21% accuracy on analytical questions. To improve this, Anthropic employs a "scaffolding" approach. This deterministic, human-defined system of rules, workflows, and data governance boosts accuracy to 95%. The scaffolding includes an "analytical stack" addressing failure modes like incorrect field selection, stale knowledge, and data retrieval issues. It uses structured documentation, explicit instructions, and a mandatory semantic layer to guide the LLM. Analysis by Claude Sonnet 4.6 and ChatGPT indicates this method replaces the LLM's probabilistic reasoning with deterministic logic. This raises questions about the necessity and cost of highly trained LLMs for such tasks.
Key takeaway
For AI Architects and Directors of AI/ML designing data analytics solutions, re-evaluate the value of expensive, highly trained LLMs. If your system's accuracy primarily stems from deterministic scaffolding and human-encoded logic, consider smaller, more cost-effective LLMs. Focus investment on robust scaffolding and clear instruction sets. The intelligence often resides outside the core LLM for these use cases.
Key insights
Deterministic scaffolding, not LLM intelligence, achieves high accuracy in analytical tasks, challenging the value proposition of expensive frontier models.
Principles
- LLMs alone are insufficient for high analytical accuracy.
- Deterministic scaffolding dramatically improves LLM task performance.
- Human-defined logic can override LLM probabilistic reasoning.
Method
Anthropic's method builds a multi-layer analytical stack with sources of truth, validation, and "skills" (deterministic instructions) to guide the LLM in data selection, query execution, and answer verification.
In practice
- Implement a mandatory semantic layer for data queries.
- Conduct adversarial SQL review via sub-agents.
- Encode business logic in LLM-readable markdown.
Topics
- Anthropic Claude
- LLM Scaffolding
- Data Analytics
- AI System Design
- Expert Systems
- Model Accuracy
Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Architect, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Discover AI.