WebmasterID logoWebmasterID
Robots & crawl control

llms.txt basics

llms.txt is a community-proposed convention for a plain-text file that helps large language models find and read your most relevant content. It complements robots.txt rather than replacing it, and like robots.txt it is a hint that cooperating tools may follow — not an enforced control.

Partially verified

What llms.txt is

llms.txt is a convention published at llmstxt.org proposing a Markdown-formatted file at your site root that points language models at your most useful content in a clean, low-noise form. The idea is that LLM-based tools, which work best with concise context, can read a curated map rather than parsing full HTML pages.

It is a proposal and an emerging convention, not an official standard like RFC 9309. Adoption by AI tools is voluntary and still developing, so treat it as an optional supplement.

How it relates to robots.txt

llms.txt and robots.txt serve different goals. robots.txt expresses crawl permissions; llms.txt offers a content map for LLMs. They are complementary — you can publish both. Neither is an enforcement mechanism: just as a non-compliant crawler can ignore robots.txt, no tool is obliged to read or honour llms.txt.

Because adoption and behaviour are still evolving, avoid relying on llms.txt for any control you actually need to guarantee.

How it appears in analytics and logs

llms.txt is a discovery aid for cooperating LLM tools. Its presence does not change crawl permissions and does not guarantee any tool will read it.

Diagnostic use case

Offer LLM-based tools a curated, plain-text map of your key pages, while understanding that adoption is voluntary and evolving.

What WebmasterID can help detect

WebmasterID shows which AI crawlers and assistants reach your pages, so you can observe activity around your content regardless of whether tools consume an llms.txt file.

Common mistakes

Privacy and accuracy notes

An llms.txt file is public, like robots.txt. Only list content you are happy to expose; it is not a privacy or access-control mechanism.

Related pages

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.