EGTR-Review: Efficient Evidence-Grounded Scientific Peer Review Generation via Multi-Agent Teacher Distillation
Summary
EGTR-Review is an Evidence-Grounded and Traceable Review Generation framework that enhances scientific peer review generation by addressing limitations of existing Large Language Model (LLM)-based methods, such as generic comments, insufficient evidence, and high inference costs from complex multi-agent systems. This framework first employs a multi-agent teacher to perform structure-aware paper decomposition, key-element extraction, external scholarly evidence retrieval, evidence-state labeling, verification reasoning, and review synthesis. Subsequently, it distills both intermediate reasoning trajectories and final review comments into a lightweight student model using task-prefix-driven multi-task learning. An evidence-weighted objective further refines supervision. Experiments on public datasets demonstrate EGTR-Review (Student) surpasses strong prompt-based, fine-tuned, and structured/agentic baselines across automatic metrics, LLM-as-Judge evaluation, and human evaluation, achieving strong factual grounding and source traceability with substantially lower token consumption and inference time.
Key takeaway
For Machine Learning Engineers developing LLM-based scientific review systems, you should consider adopting multi-agent teacher distillation to overcome issues of generic comments and high inference costs. This approach allows you to train lightweight student models that deliver evidence-grounded, traceable reviews with significantly lower token consumption and faster inference times, improving both quality and operational efficiency. Explore the provided GitHub resources to implement similar distillation strategies.
Key insights
Multi-agent teacher distillation enables efficient, evidence-grounded, and traceable scientific peer review generation by training a lightweight student model.
Principles
- Evidence grounding and traceability improve review quality.
- Distilling multi-agent reasoning enhances efficiency.
- Task-prefix-driven learning boosts student model performance.
Method
EGTR-Review constructs a multi-agent teacher for paper decomposition, evidence retrieval, and review synthesis, then distills its reasoning and comments into a student model via task-prefix-driven multi-task learning and an evidence-weighted objective.
In practice
- Utilize multi-agent systems for complex reasoning tasks.
- Apply teacher distillation to create efficient LLM agents.
- Implement evidence-weighted objectives for robust training.
Topics
- Scientific Peer Review
- Multi-Agent Systems
- Teacher Distillation
- Large Language Models
- Evidence Grounding
- Natural Language Processing
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.