Foundry IQ: Improve recall by up to 54% with knowledge bases

2026-06-02 · Source: Microsoft Foundry Blog articles · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Intermediate, long

Summary

Foundry IQ knowledge bases significantly enhance agentic retrieval performance, improving evidence recall by up to 54% and reducing token costs by 34% compared to standalone retrieval tools. Evaluated on the BrowseComp-Plus benchmark, replacing single-shot RAG with a knowledge base boosts evidence recall by up to 46%. These improvements stem from a dynamic agentic retrieval loop, an enhanced semantic ranker, better answer synthesis, and efficient token caching. The system also features schema specialization for MCP calls, adapting to model size and retrieval settings. Foundry IQ knowledge bases demonstrate superior cost-recall tradeoffs, operating on the efficiency frontier, and consistently improve delegation accuracy and answer quality across diverse datasets. For instance, they lower the no-answer rate by 94.5% and increase evidence recall by 37.9% compared to BM25 search. The functionality is available in the latest preview API.

Key takeaway

For AI Engineers optimizing retrieval-augmented generation (RAG) systems, Foundry IQ knowledge bases present a compelling alternative to standalone search tools. You should consider integrating these knowledge bases to achieve up to 54% higher evidence recall and 34% token cost savings. This allows you to deploy more responsive, accurate agents by leveraging dynamic retrieval, enhanced semantic ranking, and specialized schema adaptation, ultimately improving answer quality and reducing no-answer rates in enterprise applications.

Key insights

Foundry IQ knowledge bases significantly improve agentic retrieval's evidence recall and answer quality while reducing token costs.

Principles

Dynamic agentic retrieval loops enhance query customization and follow-up.
Schema specialization optimizes agent guidance for varying model sizes.
Ground-truth nugget metrics provide robust RAG evaluation.

Method

Knowledge bases orchestrate agentic retrieval via dynamic loops, custom queries, content review, and follow-up queries, synthesizing grounded answers.

In practice

Deploy knowledge bases to combine diverse structured and unstructured data sources.
Tune knowledge base configurations (e.g., gpt-5.4-mini) for specific cost-recall balance.

Topics

Foundry IQ
Knowledge Bases
Agentic Retrieval
RAG Performance
LLM Cost Optimization
Evidence Recall

Best for: Machine Learning Engineer, NLP Engineer, AI Engineer, MLOps Engineer, AI Architect

Related on AIssential

See Counsel's argued verdicts on the open AI decisions leaders are weighing →

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.