Anthropic's Big AI Design Change: The RED Pill

· Source: Discover AI · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering, Data Science & Analytics · Depth: Expert, extended

Summary

Anthropic's recent publication, "How Anthropic enables the self-service data analytics with Claude," reports its Claude LLM achieves only 21% accuracy on analytical questions. To improve this, Anthropic employs a "scaffolding" approach. This deterministic, human-defined system of rules, workflows, and data governance boosts accuracy to 95%. The scaffolding includes an "analytical stack" addressing failure modes like incorrect field selection, stale knowledge, and data retrieval issues. It uses structured documentation, explicit instructions, and a mandatory semantic layer to guide the LLM. Analysis by Claude Sonnet 4.6 and ChatGPT indicates this method replaces the LLM's probabilistic reasoning with deterministic logic. This raises questions about the necessity and cost of highly trained LLMs for such tasks.

Key takeaway

For AI Architects and Directors of AI/ML designing data analytics solutions, re-evaluate the value of expensive, highly trained LLMs. If your system's accuracy primarily stems from deterministic scaffolding and human-encoded logic, consider smaller, more cost-effective LLMs. Focus investment on robust scaffolding and clear instruction sets. The intelligence often resides outside the core LLM for these use cases.

Key insights

Deterministic scaffolding, not LLM intelligence, achieves high accuracy in analytical tasks, challenging the value proposition of expensive frontier models.

Principles

Method

Anthropic's method builds a multi-layer analytical stack with sources of truth, validation, and "skills" (deterministic instructions) to guide the LLM in data selection, query execution, and answer verification.

In practice

Topics

Best for: Research Scientist, CTO, VP of Engineering/Data, AI Scientist, AI Architect, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Discover AI.