Soft Token Alignment for Cross-Lingual Reasoning

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

Multilingual large language models often exhibit inconsistent reasoning across languages for semantically equivalent prompts, a problem stemming from language-specific lexical choices that cause reasoning paths to diverge. To address this, a new auxiliary objective called SOLAR is proposed for supervised fine-tuning. SOLAR aligns "soft-token representations" across languages, using English as a pivot. Soft tokens are continuous, probability-weighted mixtures of vocabulary embeddings, designed to aggregate information from semantically related tokens. By aligning non-English soft-token summaries to their English counterparts in a shared embedding space, SOLAR significantly improves accuracy. It achieves up to +17.7 points over base models and +3.8 points over standard supervised fine-tuning on four multilingual reasoning benchmarks, with the most substantial gains observed in low-resource languages. This approach also enhances final-layer cross-lingual similarity and reduces language-cluster separability, indicating better preservation of shared semantic structure.

Key takeaway

For Machine Learning Engineers developing multilingual LLMs, consider integrating soft-token alignment techniques like SOLAR. This approach can significantly improve reasoning consistency and accuracy, particularly for low-resource languages, by preserving shared semantic structure across languages. Your models will exhibit stronger cross-lingual similarity and reduced language-cluster separability, leading to more reliable performance on diverse linguistic inputs. Explore fine-tuning with auxiliary objectives that explicitly align continuous representations.

Key insights

SOLAR aligns soft-token representations across languages using English as a pivot to enhance multilingual LLM reasoning consistency and accuracy.

Principles

Method

SOLAR, an auxiliary objective for supervised fine-tuning, aligns non-English soft-token summaries to their English counterparts in a shared embedding space, using English as a pivot.

In practice

Topics

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.