Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation
Summary
A study by Siddhant Bhambri, Upasana Biswas, and Subbarao Kambhampati from Arizona State University investigates the effectiveness of using intermediate "reasoning" traces in Knowledge Distillation (KD) for Small Language Models (SLMs) in Question Answering (QA) tasks. The researchers employed a rule-based problem decomposition method for Open Book QA, breaking down complex queries into verifiable Classification and Information Retrieval steps. This approach allowed for the generation of interpretable traces whose correctness could be objectively evaluated. Supervised Fine-Tuning (SFT) experiments were conducted on Llama-3.2-1B-Instruct and Qwen3-1.7B chat models using three datasets: CoTemp QA, Microsoft MARCO QA, and Facebook bAbI QA. The surprising finding was that correct intermediate traces do not guarantee a correct final solution, and conversely, correct final solutions often arise from incorrect traces, challenging the implicit assumption that reasoning traces improve SLM performance via KD.
Key takeaway
For Research Scientists developing or deploying SLMs for QA, you should critically re-evaluate the utility of intermediate reasoning traces in Knowledge Distillation. The findings indicate that focusing solely on trace correctness during SFT may not translate to improved final solution accuracy, and models can achieve correct answers through unfaithful reasoning paths. Prioritize end-to-end solution accuracy and consider alternative distillation methods if interpretability is a key requirement, as current trace-based methods may foster a false sense of trust.
Key insights
Intermediate reasoning traces in SLM knowledge distillation do not correlate with final solution accuracy.
Principles
- Trace correctness does not guarantee solution correctness.
- Solution correctness does not imply correct intermediate traces.
Method
A rule-based problem decomposition method for Open Book QA breaks problems into Classification and Information Retrieval steps, enabling verifiable intermediate trace generation for SLM fine-tuning.
In practice
- Decompose complex QA into verifiable sub-problems.
- Evaluate trace accuracy independently from final solution accuracy.
Topics
- Knowledge Distillation
- Small Language Models
- Reasoning Traces
- Open Book QA
- Problem Decomposition
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.