LLM Reasoning Is Latent, Not the Chain of Thought
Summary
A position paper by Wenshuo Wang from South China University of Technology argues that large language model (LLM) reasoning should be primarily understood as latent-state trajectory formation, rather than explicit surface chain-of-thought (CoT). The paper formalizes three competing hypotheses: H1 (latent-state trajectories mediate reasoning), H2 (explicit surface CoT mediates reasoning), and H0 (generic serial compute explains reasoning gains). Through a reorganization of existing empirical and mechanistic work, and new compute-audited exemplars, the author finds that current evidence most strongly supports H1 as a default working hypothesis, though not as a task-independent verdict. The paper recommends treating latent-state dynamics as the default object of study for LLM reasoning and evaluating reasoning with designs that explicitly disentangle surface traces, latent states, and serial compute.
Key takeaway
For research scientists investigating LLM reasoning, you should shift your default perspective to latent-state dynamics rather than relying solely on surface chain-of-thought. This means designing experiments that explicitly separate and control for surface traces, latent states, and generic serial compute to accurately determine causal leverage and improve interpretability and safety monitoring.
Key insights
LLM reasoning is best understood as latent-state trajectory formation, not explicit chain-of-thought.
Principles
- Latent-state dynamics are the default object of LLM reasoning study.
- Reasoning evaluation must disentangle surface traces, latent states, and serial compute.
Method
A diagnostic experimental design for LLM reasoning must factorize surface traces ($S$), latent-state trajectories ($Z$), and generic serial compute ($B$), use matched controls, and define differential verdict rules.
In practice
- Use compute-audited designs to compare reasoning hypotheses.
- Prioritize latent-state dynamics in LLM reasoning research.
Topics
- LLM Reasoning
- Latent State Dynamics
- Chain of Thought
- Serial Compute
- Reasoning Regimes
Best for: Research Scientist, AI Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.