Didact: A Cross-Domain Capability Discovery System for Defence
Summary
Didact is a prototype cross-domain capability discovery system designed for defence and defence-aligned sectors, developed as an academia-industry project for Australia. It integrates publicly available Australian defence reports and policy documents with a purpose-built knowledge graph derived from Australian research publications. The system provides natural language conversations for policy-oriented workflows, leveraging a composite Retrieval-Augmented Generation (RAG) pipeline. A key feature is its interactive Evidence Rail, which visualizes retrieved evidence and source relationships, enabling users to trace generated summaries back to underlying data. Didact maintains source isolation through separate vector stores for different access levels, such as "public" and "classified" documents, and uses structured graph retrieval for academic metadata. Evaluation shows strong retrieval quality, with a Hit@1 of 0.680 and a mean LLM response latency of 2.17 seconds, demonstrating its utility and adaptability for fragmented knowledge domains.
Key takeaway
For Directors of AI/ML deploying conversational systems in high-stakes, fragmented domains like defence, you should prioritize composite RAG architectures that integrate diverse data sources. Implement interactive evidence rails to ensure auditability and user trust, allowing traceability from generated answers to original sources. Your system must also maintain strict source isolation, partitioning data by access level to meet security requirements. This approach enhances situational awareness and supports evidence-based decision-making.
Key insights
Didact demonstrates a multi-source composite RAG system for cross-domain capability discovery with interactive evidence traceability.
Principles
- Cross-domain RAG requires multi-source synthesis.
- Evidence-grounded synthesis needs source-level control.
- Interactive evidence rails enhance traceability.
Method
Didact uses an Enricher for query priming, then a LangGraph Orchestrator for composite RAG, combining tiered document and structured graph retrieval, finally assembling an interactive Evidence Rail.
In practice
- Integrate public reports with research knowledge graphs.
- Use vector stores for documents, Neo4j for graph data.
Topics
- Retrieval-Augmented Generation
- Knowledge Graphs
- Defence Intelligence
- Cross-Domain Information Retrieval
- Evidence Traceability
- Large Language Models
Code references
Best for: AI Architect, Research Scientist, AI Scientist, Policy Maker, Director of AI/ML
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.