Crawl diagnostics

Orphan pages diagnosis

An orphan page is one that no internal link points to. Crawlers discover pages mainly by following links, so an orphan is hard to find — it may exist only in a sitemap or be effectively invisible. Diagnosing orphans means comparing all known URLs against your internal link graph and fixing the gap with links.

Verified against primary sources

What an orphan page is

Crawlers discover content primarily by following links from pages they already know. An orphan page is one that has no internal links pointing to it, so there is no link path for a crawler to find it. It might still be listed in an XML sitemap, but a sitemap aids discovery rather than guaranteeing the steady crawling that well-linked pages get.

Orphans often arise from removed navigation, deleted category pages, content published without being linked, or pages reachable only through search or JavaScript-only controls.

Finding and fixing orphans

To diagnose orphans, compare your full set of known URLs — from the CMS, sitemap, or a crawl that includes the sitemap — against the set of URLs reachable by following internal links. Pages in the first set but not the second are orphans.

Fix them by adding internal links from relevant, crawlable pages: category or hub pages, related content, or navigation. Ensure those links are real anchors with href, not JavaScript-only handlers. Sitemaps help but should complement internal linking, not replace it. Once linked, an orphan becomes discoverable and is crawled like any other page.

Orphan = a URL with no internal links pointing to it
Compare known URLs against link-reachable URLs to find them
Fix with real anchor links from relevant pages; sitemaps complement

Operator checklist

Build a list of all known URLs and a list of link-reachable URLs, then find the difference. Link orphans from relevant hub, category, or related-content pages using crawlable anchors. Keep sitemaps current as a supplement. Re-check that newly linked pages start getting crawled.

How it appears in analytics and logs

An orphan page receives no internal links, so crawlers relying on link-following may not discover it or may crawl it rarely. Its absence from the internal link graph — while present in a sitemap or known URL set — is the diagnostic signal.

Diagnostic use case

Find pages with no internal links and restore their discoverability by linking them from relevant hub or category pages.

What WebmasterID can help detect

WebmasterID can show which URLs crawlers actually reach, helping you spot pages that exist but are rarely or never crawled — a signal they may be orphaned in your link structure.

Common mistakes

Relying on a sitemap alone to make pages discoverable.
Publishing content without linking it from any existing page.
Linking only via JavaScript-only controls that produce no crawlable URL.

Privacy and accuracy notes

Orphan diagnosis concerns the internal link graph and URL inventory, not personal data. WebmasterID reports crawl and discovery patterns without exposing individual visitors.

↑ All diagnostic topics in Crawl diagnostics

Sources and verification notes

Google Search Central — How Google Search crawls pagesDocuments link-following as the primary discovery mechanism.
Google Search Central — Sitemaps overview

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.