Differences in Typological Alignment in Language Models' Treatment of Differential Argument Marking

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

GPT-2 models trained on synthetic corpora demonstrate human-like typological preferences for natural markedness direction in differential argument marking (DAM) systems. Researchers extended a controlled synthetic learning method to DAM, a semantic licensing system where morphological marking depends on semantic prominence. They trained GPT-2 models on 18 distinct DAM systems and evaluated their generalization using minimal pairs. The models consistently favored systems where overt marking targets semantically atypical arguments, aligning with human language regularities. However, the models did not reproduce the strong object preference observed in human languages, where DAM more frequently targets objects over subjects. This dissociation suggests that various typological tendencies in language models may stem from different underlying mechanisms.

Key takeaway

For research scientists investigating language model typological alignment, you should note that current models replicate some human linguistic preferences, like natural markedness direction, but not others, such as the strong object preference in differential argument marking. This implies that your research into the origins of typological tendencies in LMs may need to explore distinct underlying sources for different linguistic phenomena.

Key insights

LMs show human-like markedness preferences but not object preference in differential argument marking.

Principles

Method

GPT-2 models were trained on 18 synthetic corpora implementing distinct differential argument marking (DAM) systems, then evaluated for generalization using minimal pairs.

In practice

Topics

Best for: Research Scientist, AI Researcher, AI Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.