Knowledge Beyond Language: Bridging the Gap in Multilingual Machine Unlearning Evaluation

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Cybersecurity & Data Privacy · Depth: Expert, extended

Summary

A new study introduces two metrics, Knowledge Separability Score (KSS) and Knowledge Persistence Score (KPS), to evaluate Multilingual Machine Unlearning (MMU) in Large Language Models (LLMs). Existing MMU evaluations, which are often direct extensions of per-language protocols, fail to adequately capture the cross-linguistic distribution of sensitive information. The researchers developed a multilingual parallel QA dataset of 3,800 instances across 10 languages, including high- and low-resource languages, to simulate direct memorization and indirect cross-linguistic spread. KSS measures overall unlearning quality across multiple languages, while KPS assesses consistent information removal between different language pairs. Experiments on Llama3.1-8B-Instruct and Qwen3-4B-Instruct using various unlearning methods, including Gradient Ascent (GA), Gradient Ascent with Gradient Descent term (GAGDR), Gradient Ascent with KL minimization (GAKLR), Negative Preference Optimization (NPO), and pruning, revealed that unlearning is more challenging in hold-out languages and performance degrades as the forget ratio increases.

Key takeaway

For NLP engineers developing or deploying multilingual LLMs, current unlearning evaluation methods are insufficient for ensuring privacy across languages. You should integrate the Knowledge Separability Score (KSS) and Knowledge Persistence Score (KPS) into your evaluation pipelines to accurately assess whether sensitive information has been truly removed across all relevant languages, including those not directly used in training. This will help mitigate privacy risks from cross-linguistic knowledge persistence.

Key insights

New metrics improve Multilingual Machine Unlearning evaluation by accounting for cross-linguistic knowledge spread.

Principles

Method

The study generated a 10-language, 3,800-instance multilingual parallel QA dataset from synthetic profiles, then applied KSS and KPS to evaluate unlearning methods on Llama3.1-8B-Instruct and Qwen3-4B-Instruct.

In practice

Topics

Code references

Best for: NLP Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, AI Security Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.