Foundry IQ: Improve recall by up to 54% with knowledge bases
Summary
Foundry IQ knowledge bases significantly enhance agentic retrieval performance, improving evidence recall by up to 54% and reducing token costs by 34% compared to standalone retrieval tools. Evaluated on the BrowseComp-Plus benchmark, replacing single-shot RAG with a knowledge base boosts evidence recall by up to 46%. These improvements stem from a dynamic agentic retrieval loop, an enhanced semantic ranker, better answer synthesis, and efficient token caching. The system also features schema specialization for MCP calls, adapting to model size and retrieval settings. Foundry IQ knowledge bases demonstrate superior cost-recall tradeoffs, operating on the efficiency frontier, and consistently improve delegation accuracy and answer quality across diverse datasets. For instance, they lower the no-answer rate by 94.5% and increase evidence recall by 37.9% compared to BM25 search. The functionality is available in the latest preview API.
Key takeaway
For AI Engineers optimizing retrieval-augmented generation (RAG) systems, Foundry IQ knowledge bases present a compelling alternative to standalone search tools. You should consider integrating these knowledge bases to achieve up to 54% higher evidence recall and 34% token cost savings. This allows you to deploy more responsive, accurate agents by leveraging dynamic retrieval, enhanced semantic ranking, and specialized schema adaptation, ultimately improving answer quality and reducing no-answer rates in enterprise applications.
Key insights
Foundry IQ knowledge bases significantly improve agentic retrieval's evidence recall and answer quality while reducing token costs.
Principles
- Dynamic agentic retrieval loops enhance query customization and follow-up.
- Schema specialization optimizes agent guidance for varying model sizes.
- Ground-truth nugget metrics provide robust RAG evaluation.
Method
Knowledge bases orchestrate agentic retrieval via dynamic loops, custom queries, content review, and follow-up queries, synthesizing grounded answers.
In practice
- Deploy knowledge bases to combine diverse structured and unstructured data sources.
- Tune knowledge base configurations (e.g., gpt-5.4-mini) for specific cost-recall balance.
Topics
- Foundry IQ
- Knowledge Bases
- Agentic Retrieval
- RAG Performance
- LLM Cost Optimization
- Evidence Recall
Best for: Machine Learning Engineer, NLP Engineer, AI Engineer, MLOps Engineer, AI Architect
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Microsoft Foundry Blog articles.