Enhancing Structural Mapping with LLM-derived Abstractions for Analogical Reasoning in Narratives

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning · Depth: Expert, quick

Summary

A new modular framework, YARN (Yielding Abstractions for Reasoning in Narratives), enhances machines' analogical reasoning in narratives by integrating Large Language Models (LLMs) with structural mapping. Analogical reasoning, crucial for human generalization, has been difficult for machines due to the need for pre-extracted entities in cognitive engines and LLMs' sensitivity to prompt format and surface similarity. YARN addresses this by using LLMs to decompose narratives into units, abstracting these units at four defined levels, and then feeding them to a mapping component for cross-story alignment. Experiments demonstrate that these LLM-derived abstractions consistently improve model performance, achieving competitive or superior results compared to end-to-end LLM baselines. Remaining challenges include optimal abstraction levels and incorporating implicit causality.

Key takeaway

For research scientists developing AI systems for complex reasoning, YARN offers a validated approach to improve analogical capabilities in narrative understanding. You should consider integrating LLM-derived abstractions into your structural mapping pipelines to overcome limitations of surface similarity and enhance generalization, especially when dealing with diverse narrative structures.

Key insights

LLM-derived narrative abstractions significantly improve machine analogical reasoning by enhancing structural mapping.

Principles

Method

YARN uses LLMs to decompose narratives into units, abstract these units at four levels (general meaning, story role), and then maps elements across stories for analogical reasoning.

In practice

Topics

Best for: Research Scientist, AI Scientist, NLP Engineer

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.