Interpretable Traces, Unexpected Outcomes: Investigating the Disconnect in Trace-Based Knowledge Distillation

2026-04-21 · Source: cs.AI updates on arXiv.org · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics, Natural Language Processing · Depth: Expert, extended

Summary

A study by Siddhant Bhambri, Upasana Biswas, and Subbarao Kambhampati from Arizona State University investigates the effectiveness of using intermediate "reasoning" traces in Knowledge Distillation (KD) for Small Language Models (SLMs) in Question Answering (QA) tasks. The researchers employed a rule-based problem decomposition method for Open Book QA, breaking down complex queries into verifiable Classification and Information Retrieval steps. This approach allowed for the generation of interpretable traces whose correctness could be objectively evaluated. Supervised Fine-Tuning (SFT) experiments were conducted on Llama-3.2-1B-Instruct and Qwen3-1.7B chat models using three datasets: CoTemp QA, Microsoft MARCO QA, and Facebook bAbI QA. The surprising finding was that correct intermediate traces do not guarantee a correct final solution, and conversely, correct final solutions often arise from incorrect traces, challenging the implicit assumption that reasoning traces improve SLM performance via KD.

Key takeaway

For Research Scientists developing or deploying SLMs for QA, you should critically re-evaluate the utility of intermediate reasoning traces in Knowledge Distillation. The findings indicate that focusing solely on trace correctness during SFT may not translate to improved final solution accuracy, and models can achieve correct answers through unfaithful reasoning paths. Prioritize end-to-end solution accuracy and consider alternative distillation methods if interpretability is a key requirement, as current trace-based methods may foster a false sense of trust.

Key insights

Intermediate reasoning traces in SLM knowledge distillation do not correlate with final solution accuracy.

Principles

Trace correctness does not guarantee solution correctness.
Solution correctness does not imply correct intermediate traces.

Method

A rule-based problem decomposition method for Open Book QA breaks problems into Classification and Information Retrieval steps, enabling verifiable intermediate trace generation for SLM fine-tuning.

In practice

Decompose complex QA into verifiable sub-problems.
Evaluate trace accuracy independently from final solution accuracy.

Topics

Knowledge Distillation
Small Language Models
Reasoning Traces
Open Book QA
Problem Decomposition

Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by cs.AI updates on arXiv.org.