Recognising non-named spatial entities in literary texts: a novel spatial entities classifier

· Source: Explosion · Developer tools and consulting for AI, Machine Learning and NLP - Explosion.ai · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Advanced, quick

Summary

A novel spatial entities classifier is presented, specifically designed for recognizing "non-named spatial entities" (NNSE) within literary texts. This research details a case study focused on predicting NNSE in a historical corpus composed of Swiss-German novels. The classification system leverages a deep learning model, which is implemented in conjunction with BERT and Prodigy for enhanced performance. The primary objective is to accurately identify spatial references that are not proper nouns, such as common nouns like "the village square" or "the mountain path," within complex textual data. This development provides a specialized tool for digital humanities, enabling more nuanced analysis of geographical and environmental contexts embedded in historical literary works.

Key takeaway

For NLP Engineers or Digital Humanities Researchers developing custom entity recognition models for literary corpora, this work demonstrates a viable approach for identifying non-named spatial entities. You should consider integrating deep learning models with pre-trained language models like BERT and annotation tools such as Prodigy to tackle complex, domain-specific entity types. This method can significantly enhance the granularity of spatial analysis in historical texts.

Key insights

A novel deep learning classifier identifies non-named spatial entities in Swiss-German novels using BERT and Prodigy.

Method

A deep learning model, integrated with BERT and Prodigy, is used to predict non-named spatial entities (NNSE) in a historical Swiss-German novel corpus.

Topics

Best for: AI Scientist, NLP Engineer, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Explosion · Developer tools and consulting for AI, Machine Learning and NLP - Explosion.ai.