How do crawlers discover new pages?

Primarily by following links, especially internal links, and via sitemaps. A page with no internal links and not in a sitemap is very hard for a crawler to discover.

Does crawl depth affect crawl frequency?

Generally yes. Pages closer to strong entry points like the homepage tend to be crawled more often than pages buried many clicks deep. Flattening structure helps important pages get crawled.

Crawl diagnostics

Internal linking for crawl discovery

Internal links are how crawlers discover and reach pages within a site. Google primarily finds new URLs by following links, so pages with no incoming internal links become orphans that are hard to discover. This page explains crawl depth, link equity flow, and practical patterns — hub pages, breadcrumbs, related links, and crawlable HTML anchors — that keep important pages within easy reach of a crawl.

Verified against primary sources

What this means

Search engines discover most pages by following links, especially internal ones. An internal link is both a discovery path and a relevance/importance signal: pages linked from many places, with descriptive anchor text, are easier to find and clearer in intent.

A page with no internal links pointing to it is an orphan. Even if it is in the sitemap, it is harder to discover and tends to be crawled less. Building deliberate internal links is one of the most direct ways to influence what gets crawled.

Crawl depth and link equity

Crawl depth is the number of clicks from the homepage (or another strong entry point) to a page. Pages reachable in a few clicks tend to be crawled more frequently than pages buried many levels deep. Flattening important sections reduces depth.

Links also pass equity. Links from frequently crawled, well-linked pages help downstream pages get discovered and recrawled. Concentrate internal links toward your priority pages, and avoid burying them behind long pagination chains or interaction-only navigation that crawlers may not traverse.

Crawlers discover URLs mainly by following links
Shallow crawl depth → more frequent crawling
Descriptive anchor text clarifies a target page's topic
Orphan pages (no internal links) are hard to discover

Patterns that aid discovery

Use real HTML anchor elements with href attributes; crawlers follow those, not JavaScript onclick handlers or buttons. Provide hub or category pages that link to their members, breadcrumb trails that expose hierarchy, and related-content links that connect topically.

Keep navigation crawlable without requiring user interaction, and make sure paginated or filtered sections still expose links to the underlying items. Check crawl coverage to confirm priority pages are actually being fetched, and add internal links to any that are missed.

How it appears in analytics and logs

If a page has few or no internal links, crawlers struggle to discover it and may crawl it rarely or not at all. Deep pages many clicks from the homepage are crawled less often. Strong, crawlable internal links signal importance and improve the chance a page is found and recrawled.

Diagnostic use case

Improve discovery of deep or new pages, fix orphan pages with no internal links, and flatten excessive crawl depth so crawlers reach priority content efficiently.

What WebmasterID can help detect

WebmasterID shows which URLs crawlers actually reach and how often, helping you spot pages crawlers never fetch — a strong hint that they are orphaned or buried too deep in the internal link graph.

Common mistakes

Relying on JavaScript click handlers or buttons instead of crawlable <a href> links.
Leaving important pages orphaned with no internal links pointing to them.
Burying key pages many clicks deep behind long pagination or filter chains.
Using vague anchor text (click here) that gives crawlers no topical signal.

Privacy and accuracy notes

Internal linking is a site-structure concern, independent of who visits. WebmasterID records crawler traversal as bot events; it does not track individual human navigation paths or build visitor profiles.

Frequently asked questions

How do crawlers discover new pages?: Primarily by following links, especially internal links, and via sitemaps. A page with no internal links and not in a sitemap is very hard for a crawler to discover.
Does crawl depth affect crawl frequency?: Generally yes. Pages closer to strong entry points like the homepage tend to be crawled more often than pages buried many clicks deep. Flattening structure helps important pages get crawled.

↑ All diagnostic topics in Crawl diagnostics

Sources and verification notes

Google Search Central — How Google Search works (crawling)Discovery via links and sitemaps.
Google Search Central — Crawling and indexing topics

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.