Thinking to recall: How reasoning unlocks parametric knowledge in LLMs
Summary
Google Research scientists Zorik Gekhman and Jonathan Herzig, in their June 24, 2026 study "Thinking to Recall," reveal that reasoning significantly enhances Large Language Model (LLM) recall of simple facts, even without complex logical steps. Investigating Gemini-2.5 (Flash and Pro) and Qwen3-32B models on SimpleQA Verified and EntityQuestions datasets, they found two mechanisms. First, a "computational buffer effect" where generating extra tokens, even meaningless ones like "Let me think," provides latent processing time, improving recall. Second, "factual priming" involves models generating related facts, which semantically primes the network for the correct answer, akin to human spreading activation. However, a critical risk is the "hallucination trap": a single hallucinated intermediate fact in a reasoning trace significantly reduces the likelihood of a correct final answer. The study suggests improving reliability by selecting reasoning trajectories with verifiable, hallucination-free facts.
Key takeaway
For AI Scientists and NLP Engineers developing LLMs, understanding that reasoning enhances parametric knowledge recall, even for simple facts, is crucial. You should prioritize models and training strategies that encourage verifiable, hallucination-free intermediate reasoning steps. This mitigates the "hallucination trap" where incorrect intermediate facts degrade final answer accuracy. Improving overall model reliability and factual accuracy in closed-book QA tasks is key.
Key insights
Reasoning in LLMs unlocks parametric knowledge through latent computation and factual priming, even for simple facts.
Principles
- Generating tokens provides a computational buffer for latent processing.
- Related facts prime correct answer recall via generative self-retrieval.
- Hallucinated intermediate facts significantly degrade final answer accuracy.
Method
The study used the pass@k metric on Gemini-2.5 (Flash and Pro) and Qwen3-32B models with SimpleQA Verified and EntityQuestions datasets, comparing reasoning ON vs. OFF, and testing dummy traces and extracted facts.
In practice
- Prioritize reasoning trajectories with verifiable, hallucination-free facts.
- Implement process rewards for factually supported intermediate steps.
Topics
- Large Language Models
- Parametric Knowledge
- Chain-of-Thought Reasoning
- Factual Priming
- Hallucination Detection
- Gemini-2.5
Best for: Research Scientist, AI Engineer, Machine Learning Engineer, AI Scientist, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by The latest research from Google.