llms.txt: Created for language models — ignored by agents

· Source: LLM on Medium · Field: Technology & Digital — Artificial Intelligence & Machine Learning, Software Development & Engineering · Depth: Intermediate, medium

Summary

A September 2024 proposal by answer.ai, led by Jeremy Howard, introduced "llms.txt" as a text-based instruction file for language models. Intended to provide concise summaries and references, thereby saving context window space, recent analysis from Limy based on 515 million bot events reveals only 408 inquiries to "llms.txt" files. Despite its initial purpose, generalist agents like ChatGPT, Claude, and Gemini currently do not utilize "llms.txt" for training data, general search, RAG, or direct URL lookups. The primary exception is agentic code tools such as Claude Code, Copilot, Codex, and Cursor, which increasingly use it for navigating complex developer documentation. The lack of broader adoption is attributed to the overhead and increased costs for agent providers, alongside concerns about the file's potential for inaccuracy or manipulation.

Key takeaway

For web developers and content strategists considering "llms.txt" for general websites, you should currently avoid implementing it. Generalist agents like ChatGPT and Claude largely ignore these files, making the effort a waste of resources. Instead, focus your time on creating tidy content structures, using plain language, ensuring universal design, and employing clear semantic HTML. This approach significantly improves how language models interpret and present your content to users.

Key insights

"llms.txt", designed to optimize language model context, is largely ignored by generalist agents but adopted by agentic code tools.

Principles

In practice

Topics

Best for: AI Architect, AI Product Manager, AI Engineer, Software Engineer, Director of AI/ML

Related on AIssential

Open in AIssential →

Editorial summary, takeaway, and curation by AIssential. Original article published by LLM on Medium.