LoRi: Low-Rank Distillation for Implicit Reasoning
Summary
LoRi is a novel low-rank distillation framework designed to enhance implicit chain-of-thought (iCoT) methods in large language models, which typically underperform explicit CoT prompting. The framework is motivated by the empirical finding that hidden-state reasoning trajectories exhibit a low-rank structure. LoRi transfers reasoning by aligning teacher and student trajectories within a shared low-rank tensor subspace, utilizing first- and second-order statistics to capture the global structure of reasoning while supporting a compact latent process. Evaluated across LLaMA and Qwen model families on mathematical reasoning benchmarks, LoRi consistently improves performance, particularly on challenging multi-step tasks, approaching explicit CoT accuracy and surpassing prior iCoT distillation techniques.
Key takeaway
For Machine Learning Engineers developing more efficient reasoning in large language models, LoRi offers a promising distillation approach. You should consider implementing low-rank distillation to align teacher and student reasoning trajectories, especially for multi-step mathematical tasks. This method can significantly boost implicit chain-of-thought performance, nearing explicit CoT accuracy while maintaining a compact latent process, potentially reducing inference costs compared to explicit CoT.
Key insights
Hidden-state reasoning trajectories exhibit low-rank structure, enabling efficient distillation for implicit reasoning.
Principles
- Reasoning trajectories show low-rank structure.
- Align teacher/student trajectories in low-rank subspace.
- First- and second-order statistics capture global structure.
Method
LoRi distills reasoning by aligning teacher and student hidden-state trajectories in a shared low-rank tensor subspace using first- and second-order statistics, capturing global reasoning structure.
In practice
- Improve iCoT performance on multi-step tasks.
- Approach explicit CoT accuracy with distillation.
- Apply to LLaMA and Qwen model families.
Topics
- Low-Rank Distillation
- Implicit Chain-of-Thought
- Large Language Models
- Mathematical Reasoning
- Model Distillation
- LLaMA
- Qwen
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Artificial Intelligence.