Natalie Buda Smith on AI in libraries, human at the center, deeper storytelling, and language recreation (AC Ep47)
Summary
Natalie Buda Smith, Director of AI at the Library of Congress, discusses how digital interfaces and AI are transforming access to human knowledge and cultural memory. She highlights the shift from direct primary source access to AI-intermediated information, emphasizing the importance of preserving historical context through multiple digitization versions, such as keeping 1970s scans alongside higher-quality modern ones. Smith addresses challenges like navigating proprietary data versus open APIs and the Library's approach of empowering staff with AI tools for tasks like language translation and transcribing handwritten historical texts. Collaborative, AI-powered projects, including the "Revolution Crossroads" initiative with the Smithsonian and National Archives, are surfacing new connections and enabling creative storytelling across diverse collections, like recreating a lost tribal language from an old Bible. The Library manages over 179 million physical items and provides public access to nearly 250 years of legislative history via APIs.
Key takeaway
For research scientists or AI/ML directors managing large historical or cultural datasets, you should prioritize human-centered AI deployment to empower staff and enhance public access. Consider making your digitized collections available via well-documented APIs to foster external innovation and deeper storytelling. This approach allows for personalized information delivery and the discovery of new connections, ensuring historical context is preserved even with AI intermediation.
Key insights
AI and digital interfaces are revolutionizing access to cultural memory by enabling personalized delivery and deeper contextual understanding.
Principles
- Preserve multiple digitization versions for historical context.
- Prioritize human-centered design in AI tool deployment.
- Public data accessibility fosters creative and research applications.
Method
The Library of Congress empowers staff with AI tools for translation and OCR, and collaborates externally to develop AI applications for large, diverse datasets, making them publicly available via APIs and platforms like Hugging Face.
In practice
- Use AI services to translate historical documents in unfamiliar languages.
- Apply AI for OCR on handwritten historical materials.
- Explore public APIs for large-scale data analysis and creative projects.
Topics
- AI in Libraries
- Digital Humanities
- Cultural Heritage Preservation
- Data Intermediation
- Public Sector AI
- API Accessibility
Best for: NLP Engineer, Director of AI/ML, Research Scientist, Domain Expert
Related on AIssential
Editorial summary, takeaway, and curation by AIssential. Original article published by Humans + AI.