Multilinguality at the Edge: Developing Language Models for the Global South

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Emerging Technologies & Innovation · Depth: Expert, quick

Summary

A new survey examines the "last mile" challenge in deploying language models (LMs) to non-English-speaking and hardware-constrained communities in the Global South. This challenge arises from the competing technical requirements of multilinguality and edge deployment, which are often siloed research areas despite their critical intersection for linguistically diverse communities facing severe infrastructure limitations. The survey analyzes 232 papers across the entire LM pipeline, from data collection to development and deployment, to understand the current state of the art and identify key challenges in combining these two fields. The authors also outline open questions and offer actionable recommendations for various stakeholders within the NLP ecosystem, aiming to foster more inclusive and equitable language technologies.

Key takeaway

For research scientists developing language models, you should prioritize integrating multilinguality with edge deployment considerations from the outset. This approach is crucial for creating LMs that are truly accessible and effective in hardware-constrained, non-English-speaking regions, thereby addressing the "last mile" challenge and fostering equitable technology distribution. Consider the entire LM pipeline, from data to deployment, through this dual lens.

Key insights

Multilinguality and edge deployment intersect at the "last mile" challenge for equitable LM access in the Global South.

Principles

Method

A survey of 232 papers across the LM pipeline (data collection, development, deployment) was conducted to analyze the state of the art and challenges in combining multilinguality and edge deployment.

In practice

Topics

Best for: Research Scientist, AI Scientist, NLP Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.