Archivists Turn to LLMs to Decipher Handwriting at Scale

2026-05-13 · Source: IEEE Spectrum · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Novice, medium

Summary

General-purpose AI models, specifically large language models (LLMs) like GPT-4 and Gemini, are significantly improving the ability to transcribe historical handwritten documents, a task that has historically challenged AI researchers. A study by Mark Humphries and colleagues at Wilfrid Laurier University, published in May 2025 in "Historical Methods," demonstrated that LLMs outperformed specialized handwriting recognition software like Transkribus in accuracy, speed, and cost on 18th and 19th-century English-language documents. LLM-based approaches achieved character error rates below 2% compared to Transkribus's 8%, while being 50 times faster and 1/50th the cost. This advancement is making previously inaccessible archival collections searchable, enabling new research questions for scholars and family historians, and is being adopted by institutions like the University of North Carolina at Chapel Hill and the Federal Reserve Bank of Philadelphia.

Key takeaway

For archivists and historians managing large collections of handwritten documents, the emergence of highly capable LLMs like GPT-4 and Gemini fundamentally changes the economics and feasibility of transcription. You should explore integrating these general-purpose AI models to rapidly digitize and make searchable previously inaccessible materials, significantly reducing costs and processing times compared to traditional specialized software or manual methods. This shift enables new research avenues and democratizes access for a broader audience.

Key insights

General-purpose LLMs now reliably transcribe historical handwriting, outperforming specialized software in speed, cost, and accuracy.

Principles

General methods leveraging computation often outperform specialized ones.
Vast training data enables LLMs to infer complex patterns like handwriting.

Method

Feed handwritten document images to LLMs (e.g., GPT-4, Gemini) for transcription. This method leverages the models' broad training to interpret diverse handwriting styles and even tabular structures.

In practice

Use LLMs for bulk transcription of diverse historical documents.
Apply LLMs to extract data from complex historical ledgers.
Explore tools like Archive Pearl for democratized access to AI transcription.

Topics

Large Language Models
Handwriting Recognition
Archival Digitization
Historical Research
Generative AI

Best for: NLP Engineer, AI Scientist, Research Scientist, Domain Expert, General Interest

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by IEEE Spectrum.