I built an AI data investigator that refuses to make up numbers

· Source: Data Science on Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Intermediate, medium

Summary

Veritas is an open-source MCP server designed to prevent AI tools, specifically Claude, from generating fabricated numerical data in analytical reports. It operates on a "no receipt, no claim" principle, where every number in a final answer must trace back to a verified SQL query or Python script result. The system treats complex questions as research projects, generating competing explanations and using deterministic Python code to validate all numerical claims against saved "receipts." Veritas also features a discovery mode that aggressively filters potential findings using Benjamini-Hochberg false-discovery-rate control and an effect-size floor, prioritizing silence over statistical noise. It is deliberately narrow, working with local CSV, Parquet, and Excel files via DuckDB, and includes a public evaluation suite for rigor.

Key takeaway

For data scientists or AI engineers building analytical agents, Veritas offers a critical blueprint for ensuring numerical accuracy and trustworthiness. You should integrate deterministic verification steps, like those in Veritas, into your AI-driven data investigation workflows to prevent hallucinated numbers. Consider adopting its "receipt" system and aggressive filtering for discovery modes to build more reliable and accountable AI tools, especially when dealing with sensitive business metrics.

Key insights

AI data investigation must enforce numerical veracity through verifiable query results, not just plausible language.

Principles

Method

Claude splits questions into hypotheses, runs queries, and saves results as artifacts. A deterministic Python step then verifies each reported number against these artifacts.

In practice

Topics

Code references

Best for: AI Architect, Machine Learning Engineer, AI Product Manager, AI Engineer, MLOps Engineer, Data Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Data Science on Medium.