Why are all LLMs Obsessed with Japanese Culture? On the Hidden Cultural and Regional Biases of LLMs

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Advanced, quick

Summary

A new study introduces the Culture-Related Open Questions (CROQ) dataset to analyze regional and cultural biases in Large Language Models (LLMs). Contrary to prior research suggesting Western and Anglocentric biases, this work reveals a distinct preference for Japanese culture in LLM responses. The research also indicates that LLMs generate more diverse outputs when prompted in high-resource languages like English, and exhibit less bias towards countries where the input language is official. Furthermore, the study pinpoints the emergence of this cultural bias, finding that it becomes evident after supervised fine-tuning, rather than during the initial pre-training phase of the LLMs.

Key takeaway

For research scientists evaluating LLM fairness and cultural representation, you should specifically investigate biases that emerge post-supervised fine-tuning. Your evaluations must extend beyond Anglocentric biases to detect unexpected regional preferences, such as the observed tendency towards Japanese culture, and consider how input language choice influences output diversity.

Key insights

LLMs exhibit a surprising bias towards Japanese culture, emerging post-supervised fine-tuning.

Principles

Method

The CROQ dataset, a taxonomy of Culture-Related Open Questions, was used to probe LLM cultural preferences and identify bias emergence points.

In practice

Topics

Best for: Research Scientist, AI Scientist, NLP Engineer, AI Ethicist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.