DiscoTrace: Representing and Comparing Answering Strategies of Humans and LLMs in Information-Seeking Question Answering
Summary
DiscoTrace is a new method introduced to identify and compare the rhetorical strategies used by humans and Large Language Models (LLMs) when answering information-seeking questions. The method represents answers as sequences of question-related discourse acts, interpreted based on the original question and annotated using rhetorical structure theory parses. Applying DiscoTrace to answers from nine diverse human communities revealed varied preferences in answer construction strategies. In contrast, LLMs consistently lacked rhetorical diversity in their responses, even when explicitly prompted to emulate specific human community guidelines. Furthermore, LLMs demonstrated a systematic tendency towards breadth, addressing question interpretations that human answerers typically chose to omit. These findings are intended to inform the development of more pragmatically sophisticated LLM answerers that can adapt their strategies based on context in question answering.
Key takeaway
For AI Product Managers developing LLM-powered QA systems, you should recognize that current LLMs do not naturally replicate the nuanced rhetorical diversity of human responses. Your development efforts should focus on integrating mechanisms that allow LLMs to adopt context-specific answering strategies, moving beyond their default tendency for broad, undifferentiated responses. This will enhance the pragmatic quality and user satisfaction of your QA applications.
Key insights
LLMs lack rhetorical diversity and systematically over-address question interpretations compared to human answerers.
Principles
- Human communities exhibit diverse answering strategies.
- LLMs do not naturally mimic human rhetorical diversity.
Method
DiscoTrace represents answers as sequences of question-related discourse acts, interpreted against the original question, and annotated on rhetorical structure theory parses.
In practice
- Analyze LLM outputs for rhetorical breadth.
- Develop LLMs with context-aware answering strategies.
Topics
- DiscoTrace
- Rhetorical Strategies
- Information-Seeking QA
- LLM Answering Strategies
- Human-LLM Comparison
Code references
Best for: Research Scientist, AI Product Manager, AI Scientist, NLP Engineer, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.