Cross-lingual Relation Extraction with Large Language Models: Zero-Shot, Few-Shot, and Fine-Tuned Evaluation on Romanian

· Source: Artificial Intelligence · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Natural Language Processing · Depth: Expert, quick

Summary

A study investigated cross-lingual relation extraction (RE) for Romanian, a low-resource language, by combining automatic dataset translation with large language model (LLM) inference. Researchers translated the SemEval-2010 Task 8 benchmark from English to Romanian using an LLM-based pipeline. They evaluated Gemma 4 31B in zero-shot, few-shot, and QLoRA fine-tuned configurations, alongside four encoder baselines ranging from 125M to 560M parameters, including XLM-RoBERTa and Romanian BERT. Results showed Romanian incurred a 3 to 5 percentage point (pp) drop relative to English in prompt-only settings, with few-shot prompting offering marginal gains. QLoRA fine-tuning significantly improved macro F1-Score by over 22 pp in both languages and reduced the cross-lingual gap from 3.3 to 1.4 pp. Notably, smaller encoder baselines performed within 1-4 pp of QLoRA Gemma on Romanian, suggesting a weak case for large 31B models in compute-sensitive deployment scenarios for single-task RE. The translated dataset, evaluation code, and trained models are released.

Key takeaway

For Machine Learning Engineers developing relation extraction systems for low-resource languages, you should prioritize QLoRA fine-tuning over zero-shot or few-shot prompting. While QLoRA significantly improves macro F1-Score by over 22 percentage points and reduces cross-lingual performance gaps, consider that smaller encoder models can achieve comparable results within 1-4 percentage points of a 31B LLM. This suggests optimizing for compute efficiency by evaluating smaller, specialized models for deployment.

Key insights

QLoRA fine-tuning significantly boosts cross-lingual relation extraction performance for low-resource languages, outperforming prompt-only LLMs.

Principles

Method

Cross-lingual RE for Romanian was achieved by translating the SemEval-2010 Task 8 benchmark from English using an LLM-based pipeline, then evaluating Gemma 4 31B and encoder baselines.

In practice

Topics

Best for: Research Scientist, AI Engineer, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.