Do Language Models Track Entities Across State Changes?

2026-05-28 · Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A recent study investigates how transformer language models (LMs) perform entity tracking (ET) in complex natural language scenarios involving multiple state-changing operations like PUT, REMOVE, and MOVE. Researchers found that LMs do not incrementally track world states across tokens or query-relevant states across layers. Instead, LMs aggregate all relevant information in parallel at the final token once the query becomes clear. The investigation further revealed that LMs implement the REMOVE operation using a fragile global suppression tag, a mechanism that predicts various behavioral failure modes confirmed by the study. A mechanistic solution involving nullifying this tag was proposed to partially mitigate the issue. Overall, the findings indicate LMs employ a non-sequential strategy to solve a fundamentally sequential task, demonstrating the interplay between behavioral and mechanistic analyses.

Key takeaway

For NLP Engineers designing systems requiring robust entity tracking, understand that current transformer LMs do not incrementally track state. Your models likely aggregate information in parallel, making them susceptible to specific failure modes, particularly with REMOVE operations due to fragile global suppression tags. You should consider these non-incremental processing characteristics when debugging or designing systems, and explore methods like nullifying suppression tags to improve reliability in state-changing contexts.

Key insights

Language models tackle sequential entity tracking non-incrementally, aggregating information in parallel, and exhibit fragile removal mechanisms.

Principles

LMs solve sequential tasks non-sequentially.
Global suppression tags cause LM removal failures.
Behavioral and mechanistic analyses interact fruitfully.

Method

Researchers investigated LM mechanisms for PUT, REMOVE, and MOVE operations, characterizing non-incremental ET. A mechanistic solution of nullifying the global suppression tag was proposed for REMOVE.

In practice

Nullify the global suppression tag for REMOVE.
Inform mechanistic hypotheses with behavioral results.
Build stronger evaluations from mechanistic insights.

Topics

Language Models
Entity Tracking
Transformer Architectures
Mechanistic Interpretability
State Changes
Failure Modes

Best for: Research Scientist, AI Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.