CRAFT: A Unified Counterfactual Reasoning Framework for Tabular Question Answering and Fact Verification
Summary
CRAFT, a unified Counterfactual Reasoning Framework, addresses the challenges large language models (LLMs) face in multi-step inference over long, structured tables for tasks like tabular question answering and fact verification. Existing methods often rely on single-direction reasoning, limiting hypothesis exploration. CRAFT reformulates these tasks into a bidirectional verification process by explicitly constructing both declarative statements and their counterfactual variants. It extracts evidence from reasoning along both original and counterfactual paths, integrating it via a weighted mechanism to derive final answers. Experimental results demonstrate CRAFT consistently surpasses representative baselines on datasets such as WikiTQ and TabFact, showing significant improvements, particularly for complex question answering. The framework also substantially mitigates performance disparities among different backbone LLMs, indicating counterfactual reasoning enhances LLM discernment in structured reasoning.
Key takeaway
For Machine Learning Engineers developing LLM solutions for complex tabular data tasks, CRAFT's counterfactual reasoning framework offers a robust approach. You should consider implementing bidirectional verification, explicitly generating counterfactual statements alongside original claims, to enhance reasoning and mitigate performance gaps. This method significantly improves accuracy on multi-step inference over structured tables, particularly for complex question answering and fact verification.
Key insights
CRAFT uses bidirectional counterfactual reasoning to improve LLM performance on complex tabular data tasks.
Principles
- Bidirectional verification enhances reasoning.
- Counterfactuals improve hypothesis exploration.
- Weighted evidence integration refines answers.
Method
Reformulate tabular tasks into bidirectional verification. Construct declarative and counterfactual statements. Extract evidence from both paths, then integrate via a weighted mechanism.
In practice
- Apply to tabular Q&A.
- Use for fact verification.
- Improve complex table reasoning.
Topics
- Large Language Models
- Counterfactual Reasoning
- Tabular Question Answering
- Fact Verification
- Bidirectional Verification
- WikiTQ
- TabFact
Best for: Research Scientist, AI Scientist, Machine Learning Engineer, NLP Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.