GraphLit: Learning Text-Enriched Dynamic Character Network Representations for Literary Study
Summary
GraphLit is a novel self-supervised learning framework designed for literary study, which learns rich representations of long novels by integrating character interactions with their textual contexts. It introduces Dynamic Heterogeneous Character Networks (DHCNs), a method to organize novels into temporally localized heterogeneous graphs. Researchers extracted approximately 20,000 DHCNs from Project Gutenberg to train GraphLit using a masked graph autoencoder objective. This framework significantly improves performance across 12 character-related tasks, particularly those demanding contextual understanding, surpassing both text-only and graph-only baseline models. Furthermore, GraphLit and DHCNs demonstrate applicability in literary analysis, specifically for examining the relationship between narrative non-linearity and dynamic social features within texts.
Key takeaway
For NLP researchers or digital humanists analyzing complex narrative structures, traditional graph-only or text-only methods may overlook critical textual context. You should consider Dynamic Heterogeneous Character Networks (DHCNs) and the GraphLit framework to integrate character interactions with their surrounding text. This approach provides richer, context-aware representations, enabling deeper insights into dynamic social features and narrative non-linearity in literary texts.
Key insights
GraphLit integrates dynamic character networks with textual context via self-supervised learning to enhance literary analysis.
Principles
- Textual context enriches character interaction graphs.
- Dynamic graphs capture temporal evolution in narratives.
- Self-supervised learning yields rich literary representations.
Method
Novels are organized into Dynamic Heterogeneous Character Networks (DHCNs) as temporally localized graphs. GraphLit then applies a masked graph autoencoder objective on these DHCNs for self-supervised representation learning.
In practice
- Analyze narrative non-linearity in novels.
- Study dynamic social features within texts.
- Improve character-related task performance.
Topics
- GraphLit
- Dynamic Heterogeneous Character Networks
- Literary Analysis
- Self-supervised Learning
- Character Networks
- Project Gutenberg
Code references
Best for: AI Scientist, Research Scientist, Machine Learning Engineer
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Takara TLDR - Daily AI Papers.