AISysRev -- LLM-based Tool for Title-abstract Screening
Summary
AiSysRev is a new LLM-based web application designed to accelerate the title-abstract screening phase of systematic reviews, a notoriously laborious task in software engineering that can take 67 weeks on average. Implemented as a Docker container, the tool accepts CSV files of paper titles and abstracts, allowing users to define inclusion/exclusion criteria. It supports multiple LLMs via OpenRouter, offering both zero-shot and few-shot screening capabilities. AiSysRev also integrates manual screening, displaying LLM outputs as guidance for human reviewers, and allows exporting results to CSV for further analysis. A trial study with 137 papers identified four classification categories: Easy Includes, Easy Excludes, Boundary Includes, and Boundary Excludes, highlighting that human intervention remains crucial for "Boundary" cases where LLMs are prone to errors. The tool aims to reduce the burden of assessing large volumes of scientific literature, particularly for rapid reviews.
Key takeaway
For AI Engineers and Research Scientists conducting systematic literature reviews, AiSysRev offers a practical approach to significantly reduce screening effort. You should integrate this Dockerized tool to automate initial title-abstract screening, especially for high-volume or rapid reviews. While LLMs handle clear inclusions/exclusions efficiently, always plan for human review of "Boundary" cases, leveraging the tool's probability scores to focus your efforts where LLMs are most likely to err, ensuring accuracy without sacrificing speed.
Key insights
LLMs can significantly expedite systematic review screening, but human oversight remains critical for ambiguous cases.
Principles
- LLMs excel at clear-cut screening decisions.
- Human judgment is essential for "Boundary" cases.
- Few-shot prompting improves LLM screening performance.
Method
AiSysRev processes CSV input with titles/abstracts, applies user-defined criteria via multiple LLMs (zero-shot/few-shot), and provides LLM outputs for guided manual review, exporting results to CSV.
In practice
- Use multiple LLMs for screening to compare results.
- Prioritize manual review for papers with uncertain LLM scores.
- Apply cutoff rules in rapid reviews for efficiency.
Topics
- AISysRev Tool
- LLM-based Screening
- Systematic Literature Reviews
- Title-Abstract Screening
- Software Engineering Research
Code references
Best for: Research Scientist, AI Scientist, AI Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.