How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning

· Source: cs.CL updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Expert, extended

Summary

Researchers from Nanjing University, Alibaba Group, and Ant Group analyzed how "thinking LLMs" like DeepSeek-R1 and Qwen3-4B-Thinking read their own reasoning traces to produce answers, focusing on quantitative reasoning tasks. They identified a "benign self-reading pattern" in correct solutions, characterized by a forward shift of attention focus along the reasoning trace and persistent concentration on key semantic anchors. Incorrect solutions, conversely, exhibited diffuse and irregular attention. Interpreting this benign pattern as a sign of internal certainty, the team developed a training-free steering method using Self-Reading Quality (SRQ) scores. SRQ combines geometric metrics for process control and semantic metrics for content monitoring to guide LLMs toward this beneficial self-reading behavior. Experiments on GSM8K, MATH500, and SVAMP benchmarks showed consistent accuracy gains, up to 2.6% over base LLMs and 6.6% on AIME24–25, confirming the link between self-reading quality and answer correctness.

Key takeaway

For AI Engineers and Research Scientists developing or deploying reasoning LLMs, understanding and steering self-reading patterns is crucial. Your models' internal attention dynamics during answer generation directly impact correctness. By implementing SRQ-driven steering, you can guide LLMs toward more certain and grounded internal states, potentially boosting accuracy on quantitative tasks like math word problems and complex scientific questions, even with noisy reasoning traces. Consider integrating SRQ metrics into your model evaluation and fine-tuning pipelines to enhance reliability.

Key insights

Benign self-reading patterns in LLMs, marked by focused, forward-shifting attention, correlate with correct quantitative reasoning.

Principles

Method

The Self-Reading Quality (SRQ) steering method quantifies attention patterns using geometric and semantic metrics to construct steering vectors, guiding LLMs toward benign self-reading during inference.

In practice

Topics

Best for: AI Engineer, Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.CL updates on arXiv.org.