Graphs of Research: Citation Evolution Graphs as Supervision for Research Idea Generation
Summary
Graphs of Research (GoR) is a supervised fine-tuning (SFT) method that enhances large language models (LLMs) for automated research idea generation by incorporating citation-evolution graphs. Unlike existing methods that treat references as flat text, GoR extracts a 2-hop reference neighborhood for each seed paper, derives structural relations (citation position, frequency, predecessor links, publication time), and organizes them into a paper-evolution directed acyclic graph (DAG). An automated pipeline extracts data from five major ML/NLP venues, creating a dataset of 498 train, 50 validation, and 50 test seed papers with approximately 7,600 cited references. Qwen2.5-7B-Instruct-1M is fine-tuned on a structured-text prompt that includes the citation graph, edge signals, and reference information. GoR-SFT achieved state-of-the-art performance in head-to-head LLM-judge tournaments against gpt-4o-driven baselines, demonstrating the effectiveness of citation-evolution graphs as a supervision signal.
Key takeaway
For research scientists developing automated ideation systems, you should consider integrating citation-evolution graphs as a supervised fine-tuning signal. This approach, exemplified by GoR-SFT, enables smaller 7B models to outperform larger LLMs like gpt-4o on idea quality and cost-efficiency by providing structured relational context, particularly enhancing significance and clarity in generated ideas. Explore building graph-aware datasets and fine-tuning open-source LLMs to accelerate scientific innovation.
Key insights
Citation-evolution graphs, when used as an SFT signal, significantly improve LLM-based research idea generation.
Principles
- Structural relations among references are crucial for idea generation.
- Supervised fine-tuning internalizes graph signals more effectively than zero-shot prompting.
Method
Extract a 2-hop reference neighborhood, derive relations from citation context, organize into a paper-evolution DAG, serialize as structured text, then fine-tune an LLM (e.g., Qwen2.5-7B-Instruct-1M) to predict a five-field idea.
In practice
- Use 2-hop reference neighborhoods for comprehensive context.
- Annotate edges with features like role, influence, recency, and topology.
- Fine-tune smaller LLMs with graph data for cost-effective SOTA.
Topics
- Research Idea Generation
- LLM-based Idea Generation
- Citation Evolution Graphs
- Supervised Fine-tuning
- Qwen2.5-7B-Instruct-1M
Code references
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.