Knowledge Beyond Language: Bridging the Gap in Multilingual Machine Unlearning Evaluation
Summary
A new study introduces two metrics, Knowledge Separability Score (KSS) and Knowledge Persistence Score (KPS), to evaluate Multilingual Machine Unlearning (MMU) in Large Language Models (LLMs). Existing MMU evaluations, which are often direct extensions of per-language protocols, fail to adequately capture the cross-linguistic distribution of sensitive information. The researchers developed a multilingual parallel QA dataset of 3,800 instances across 10 languages, including high- and low-resource languages, to simulate direct memorization and indirect cross-linguistic spread. KSS measures overall unlearning quality across multiple languages, while KPS assesses consistent information removal between different language pairs. Experiments on Llama3.1-8B-Instruct and Qwen3-4B-Instruct using various unlearning methods, including Gradient Ascent (GA), Gradient Ascent with Gradient Descent term (GAGDR), Gradient Ascent with KL minimization (GAKLR), Negative Preference Optimization (NPO), and pruning, revealed that unlearning is more challenging in hold-out languages and performance degrades as the forget ratio increases.
Key takeaway
For NLP engineers developing or deploying multilingual LLMs, current unlearning evaluation methods are insufficient for ensuring privacy across languages. You should integrate the Knowledge Separability Score (KSS) and Knowledge Persistence Score (KPS) into your evaluation pipelines to accurately assess whether sensitive information has been truly removed across all relevant languages, including those not directly used in training. This will help mitigate privacy risks from cross-linguistic knowledge persistence.
Key insights
New metrics improve Multilingual Machine Unlearning evaluation by accounting for cross-linguistic knowledge spread.
Principles
- Knowledge in MMU is distributed across languages.
- Cross-linguistic spread of knowledge can persist post-unlearning.
- Unlearning performance degrades with increased forget ratio.
Method
The study generated a 10-language, 3,800-instance multilingual parallel QA dataset from synthetic profiles, then applied KSS and KPS to evaluate unlearning methods on Llama3.1-8B-Instruct and Qwen3-4B-Instruct.
In practice
- Use KSS-ROC and KSS-PR for comprehensive unlearning quality.
- Employ KPS to quantify cross-lingual knowledge persistence.
- Consider hold-out languages for robust MMU evaluation.
Topics
- Multilingual Machine Unlearning
- Knowledge Separability Score
- Knowledge Persistence Score
- LLM Privacy Risks
- Cross-linguistic Knowledge Spread
Code references
Best for: NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.