Archivists Turn to LLMs to Decipher Handwriting at Scale

· Source: IEEE Spectrum · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Novice, medium

Summary

General-purpose AI models, specifically large language models (LLMs) like GPT-4 and Gemini, are significantly improving the ability to transcribe historical handwritten documents, a task that has historically challenged AI researchers. A study by Mark Humphries and colleagues at Wilfrid Laurier University, published in May 2025 in "Historical Methods," demonstrated that LLMs outperformed specialized handwriting recognition software like Transkribus in accuracy, speed, and cost on 18th and 19th-century English-language documents. LLM-based approaches achieved character error rates below 2% compared to Transkribus's 8%, while being 50 times faster and 1/50th the cost. This advancement is making previously inaccessible archival collections searchable, enabling new research questions for scholars and family historians, and is being adopted by institutions like the University of North Carolina at Chapel Hill and the Federal Reserve Bank of Philadelphia.

Key takeaway

For archivists and historians managing large collections of handwritten documents, the emergence of highly capable LLMs like GPT-4 and Gemini fundamentally changes the economics and feasibility of transcription. You should explore integrating these general-purpose AI models to rapidly digitize and make searchable previously inaccessible materials, significantly reducing costs and processing times compared to traditional specialized software or manual methods. This shift enables new research avenues and democratizes access for a broader audience.

Key insights

General-purpose LLMs now reliably transcribe historical handwriting, outperforming specialized software in speed, cost, and accuracy.

Principles

Method

Feed handwritten document images to LLMs (e.g., GPT-4, Gemini) for transcription. This method leverages the models' broad training to interpret diverse handwriting styles and even tabular structures.

In practice

Topics

Best for: NLP Engineer, AI Scientist, Research Scientist, Domain Expert, General Interest

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by IEEE Spectrum.