Presentation: AI Agents to Make Sense of Data at OpenAI

2026-06-19 · Source: InfoQ · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Software Development & Engineering · Depth: Advanced, extended

Summary

OpenAI's Bonnie Xu presented Kepler, an internal AI data analyst agent designed to query over 600 petabytes of data across 70,000 datasets. Kepler addresses challenges like table discovery and complex SQL generation by leveraging a Multi-tool Co-optimization Process (MCP), automated code crawling for table metadata, and Retrieval Augmented Generation (RAG) for company context. The system also incorporates scoped semantic memory for continuous self-learning and utilizes AST-based LLM grading in its evaluation pipeline to ensure accuracy and prevent regressions. Kepler is available 24/7 via Slack, UI, or IDE, enabling users to ask complex data questions and receive detailed analyses, including charts and anomaly debugging.

Key takeaway

For MLOps Engineers building internal data tools, OpenAI's Kepler demonstrates a robust architecture for AI agents handling massive datasets. You should prioritize integrating comprehensive context beyond basic metadata, implement scoped memory for continuous learning, and establish AST-based LLM grading for evaluation to ensure accuracy and prevent regressions in complex query generation. This approach significantly boosts data productivity and user trust.

Key insights

OpenAI's Kepler agent uses advanced AI techniques to automate complex data analysis across vast datasets, improving data accessibility and accuracy.

Principles

Context beyond table metadata is crucial.
Memory enables continuous agent learning.
Robust evals prevent model regression.

Method

Kepler employs MCP for interactive data exploration, automated code crawling for fresh table metadata, RAG for company context, and scoped semantic memory for corrections, all evaluated via AST-based LLM grading.

In practice

Query 600+ petabytes via natural language.
Debug data anomalies automatically.
Generate complex SQL and charts.

Topics

AI Agents
Data Analysis
Large Language Models
Retrieval-Augmented Generation
Data Platforms
SQL Generation
Evaluation Metrics

Best for: AI Architect, CTO, VP of Engineering/Data, AI Engineer, Machine Learning Engineer, MLOps Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by InfoQ.