CorPipe at CRAC 2026: Empty Nodes and Cross-Lingual Transfer in Multilingual Coreference Resolution
Summary
CorPipe 26, the winning submission to the CRAC 2026 Shared Task on Multilingual Coreference Resolution, introduces an improved system that predicts empty nodes alongside mentions and coreference links within a single model. This fifth edition of the task focused on comparing generative LLMs and specialized systems, incorporating 5 new datasets and 2 new languages. CorPipe 26 significantly outperformed all other submissions, achieving a 2.8 percentage point lead in the LLM track and a 9.5 percentage point lead in the unconstrained track. The authors also conducted ablation experiments on model sizes, empty node prediction methods, and cross-lingual zero-shot evaluation. The source code and trained models are publicly available at https://github.com/ufal/crac2026-corpipe.
Key takeaway
For NLP Engineers or AI Scientists focused on advancing multilingual coreference resolution, CorPipe 26 demonstrates a critical architectural improvement. Its integrated approach to predicting empty nodes, mentions, and coreference links within a single model significantly outperforms both generative LLMs and other specialized systems. You should consider adopting this unified empty node prediction strategy or leveraging the publicly available CorPipe 26 models and source code to enhance your own multilingual NLP pipelines.
Key insights
CorPipe 26 integrates empty node prediction with mention and coreference resolution for improved multilingual performance.
Principles
- Integrating empty node prediction enhances coreference resolution.
- Specialized systems can outperform generative LLMs.
- Cross-lingual transfer is crucial for multilingual NLP.
Method
CorPipe 26 employs a single model to predict empty nodes, mentions, and coreference links, building upon CorPipe 25's architecture for multilingual coreference.
In practice
- Explore empty node prediction for coreference tasks.
- Evaluate specialized models against LLMs for specific NLP.
- Utilize CorPipe 26's public code for multilingual coreference.
Topics
- Multilingual Coreference Resolution
- Empty Node Prediction
- Cross-Lingual Transfer
- Generative LLMs
- Specialized NLP Systems
- CRAC Shared Task
Code references
Best for: AI Engineer, Research Scientist, AI Scientist, NLP Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.