How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning

2026-04-22 · Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

Researchers from Nanjing University, Alibaba Group, and Ant Group analyzed how "thinking LLMs" like DeepSeek-R1 and Qwen3-4B-Thinking read their own reasoning traces to produce answers, focusing on quantitative reasoning tasks. They identified a "benign self-reading pattern" in correct solutions, characterized by a forward shift of attention focus along the reasoning trace and persistent concentration on key semantic anchors. Incorrect solutions, conversely, exhibited diffuse and irregular attention. Interpreting this benign pattern as a sign of internal certainty, the team developed a training-free steering method using Self-Reading Quality (SRQ) scores. SRQ combines geometric metrics for process control and semantic metrics for content monitoring to guide LLMs toward this beneficial self-reading behavior. Experiments on GSM8K, MATH500, and SVAMP benchmarks showed consistent accuracy gains, up to 2.6% over base LLMs and 6.6% on AIME24–25, confirming the link between self-reading quality and answer correctness.

Key takeaway

For AI Engineers and Research Scientists developing or deploying reasoning LLMs, understanding and steering self-reading patterns is crucial. Your models' internal attention dynamics during answer generation directly impact correctness. By implementing SRQ-driven steering, you can guide LLMs toward more certain and grounded internal states, potentially boosting accuracy on quantitative tasks like math word problems and complex scientific questions, even with noisy reasoning traces. Consider integrating SRQ metrics into your model evaluation and fine-tuning pipelines to enhance reliability.

Key insights

Benign self-reading patterns in LLMs, marked by focused, forward-shifting attention, correlate with correct quantitative reasoning.

Principles

Answer tokens act as a meta-level operation, reading reasoning traces.
Structured attention reflects internal certainty and cognitive control.
Disorganized attention signals cognitive uncertainty and potential errors.

Method

The Self-Reading Quality (SRQ) steering method quantifies attention patterns using geometric and semantic metrics to construct steering vectors, guiding LLMs toward benign self-reading during inference.

In practice

Analyze attention centroids for forward-shifting patterns.
Identify semantic anchors in reasoning traces.
Apply SRQ-driven steering to improve quantitative reasoning accuracy.

Topics

Thinking LLMs
Quantitative Reasoning
Self-Reading Patterns
Attention Analysis
Activation Steering

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.