AI-assisted cultural heritage dissemination: Comparing NMT and glossary-augmented LLM translation in rock art documents

· Source: Computation and Language · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Data Science & Analytics · Depth: Advanced, quick

Summary

A study compared three English machine translation (MT) setups for a Spanish academic rock art text to address challenges in multilingual cultural heritage dissemination. The setups included DeepL as a Neural Machine Translation (NMT) baseline, Gemini-Simple using a Large Language Model (LLM) with a basic prompt, and Gemini-RAG, which augmented the same LLM with glossary-based prompting via term-pair retrieval. Human evaluation, conducted using multi-way Direct Assessment (0-100) and targeted terminology auditing with an MQM taxonomy, revealed that Gemini-RAG achieved the highest exact-match terminology accuracy at 81.4%, significantly outperforming Gemini-Simple (69.1%) and DeepL (64.4%). Gemini-RAG also maintained high overall quality with a mean Direct Assessment score of 85.3, comparable to Gemini-Simple's 85.2, and superior to DeepL's 80.3. These findings suggest that glossary-augmented prompting is an effective, low-overhead method for enhancing terminology control in specialized translation.

Key takeaway

For cultural heritage institutions translating specialized documents, integrating glossary-augmented prompting with LLMs can dramatically improve terminology accuracy and overall translation quality. You should prioritize maintaining concise terminology resources and establishing lightweight evaluation procedures to implement this low-overhead method effectively, ensuring consistent and accurate dissemination of complex information to a global audience.

Key insights

Glossary-augmented LLM prompting significantly improves terminology accuracy in specialized translation without sacrificing overall quality.

Principles

Method

The study compared DeepL NMT, a basic LLM prompt (Gemini-Simple), and a glossary-augmented LLM prompt (Gemini-RAG) for Spanish-to-English rock art text translation, evaluated by human direct assessment and terminology auditing.

In practice

Topics

Best for: AI Engineer, Machine Learning Engineer, NLP Engineer, AI Scientist, Research Scientist

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by Computation and Language.