SafeLLM: Extraction as a Hallucination-Resistant Alternative to Rewriting in Safety-Critical Settings
Summary
SafeLLM introduces an extraction-based approach as a hallucination-resistant alternative to rewriting in retrieval-augmented generation (RAG) systems, particularly for safety- and compliance-critical applications accessing organizational documentation like SOPs and HR policies. The research evaluates various prompting strategies, including line-number-based source selection, explicit safety-annotated sentence extraction, and a multi-stage refinement pipeline. Experiments were conducted using local NHS acute care, oncology, and UK-wide NICE guidelines, employing both frontier-scale and locally deployable large language models. Results indicate that line-number selection yields the strongest performance, surpassing direct copying and safety-focused methods across different model scales, maintaining up to 95% term recall and close alignment with source text. While safety-oriented approaches enhance precision, they also introduce systematic omissions, a trade-off further amplified by multi-stage filtering. Performance is influenced by document structure, with line-based extraction excelling in protocol-like content and other strategies performing better on verbose documents, achieving up to 97% term recall.
Key takeaway
For Machine Learning Engineers developing RAG systems for safety-critical applications, prioritize extraction-based methods over free-form rewriting to mitigate hallucination risks. Specifically, implement line-number-based source selection, as it demonstrated superior performance and up to 95% term recall, especially with protocol-like content. Be aware that overly safety-oriented filtering or multi-stage refinement can introduce systematic omissions, impacting completeness. Your choice of extraction strategy should also consider the document's verbosity and structure for optimal results.
Key insights
Extraction, especially line-number-based, reduces LLM hallucinations in RAG for safety-critical documents better than rewriting or safety-focused filtering.
Principles
- Line-number selection improves RAG accuracy.
- Rewriting-based RAG introduces hallucinations.
- Safety-focused filtering can cause omissions.
Method
Compares line-number-based source selection, explicit safety-annotated sentence extraction, and a multi-stage pipeline for refining answers using source guidelines.
In practice
- Apply line-number extraction for SOPs.
- Use extraction for compliance guidelines.
- Consider document structure for strategy.
Topics
- Large Language Models
- Retrieval-Augmented Generation
- Hallucination Mitigation
- Information Extraction
- Safety-Critical Systems
- Document Analysis
Best for: AI Architect, AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.