Robots & crawl control

How crawlers cache robots.txt

Crawlers do not re-fetch robots.txt on every request — they cache it. This page explains Google's caching window, why your edits take time to take effect, and how caching interacts with HTTP cache headers and fetch failures.

Verified against primary sources

Google's caching window

Google documents that it generally caches robots.txt for up to 24 hours, and may use a cached copy longer if the file becomes unreachable. So a rule you add now may not be enforced for some time, and a rule you remove may keep applying briefly.

You can influence freshness with standard HTTP caching headers (for example a max-age), but treat the cache as a built-in delay: plan robots.txt changes ahead of time rather than expecting instant effect.

Google caches robots.txt for up to ~24 hours
HTTP cache headers can influence freshness
Edits are not honoured instantly

When robots.txt is unreachable

Caching also shapes failure behavior. If a previously successful robots.txt becomes temporarily unreachable (5xx or timeout), Google may keep using the last cached version for a period rather than immediately assuming open or closed crawling. A prolonged failure changes that behavior — see the dedicated pages on redirects and 404 handling.

To prompt a fresh fetch in Google after an important change, you can request a recrawl of robots.txt in Search Console rather than waiting out the cache.

How it appears in analytics and logs

If a crawler keeps following an old rule shortly after you edit robots.txt, caching is the usual cause — the crawler is still using a previously fetched copy.

Diagnostic use case

Set expectations when you change robots.txt — knowing why a new rule is not honoured immediately and how to prompt a faster refresh.

What WebmasterID can help detect

WebmasterID timestamps crawler hits, so after a robots.txt edit you can watch when behavior actually shifts — revealing the cache-refresh lag in practice.

Common mistakes

Expecting a robots.txt edit to take effect immediately.
Ignoring HTTP cache headers that prolong an outdated file.
Assuming a brief robots.txt outage instantly changes crawl behavior.

Privacy and accuracy notes

Caching concerns the robots.txt file itself, not visitors. No personal data is involved in how a crawler stores or refreshes the file.

↑ All robots topics in Robots & crawl control

Sources and verification notes

Google — How Google interprets robots.txt (caching)Documents the up-to-24-hour caching window and unreachable handling.

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.