User agents

Spoofed and fake user agents: what to watch for

Spoofing a user agent is trivial — any client can claim to be Googlebot or a normal browser. This page explains why spoofing happens, the common fake-crawler patterns, and the verification methods that turn a claimed identity into a confirmed one.

Verified against primary sources

Why user agents get spoofed

Clients spoof user agents to blend in, bypass naive blocks, or impersonate a trusted crawler so a site serves them content it would not serve a scraper. Because the string is fully client-controlled, spoofing requires no special tooling.

How to verify instead of trust

The major crawlers publish verification methods. The two reliable approaches are reverse DNS (the IP resolves into the crawler's official domain, with a matching forward lookup) and matching the source IP against the operator's published IP ranges. A user agent that claims a crawler but fails both is not that crawler.

Reverse DNS into the official crawler domain, then forward-confirm
Match the IP against published crawler ranges
Never grant trust on the user-agent string alone

Fake browsers vs fake crawlers

Two patterns dominate: scrapers wearing a normal browser string to look human, and clients wearing a crawler string (often 'Googlebot') to get crawler treatment. The first inflates human metrics; the second can leak content. Both are handled the same way — verify, and categorise the unverifiable honestly.

How it appears in analytics and logs

A request whose user agent claims a major crawler but fails reverse-DNS or IP verification is spoofed. Treat it as unverified automation, not as the crawler it names.

Diagnostic use case

Spot requests that claim to be a trusted crawler but are not, before you grant them special treatment or read them as real search activity.

What WebmasterID can help detect

WebmasterID categorises by user agent and, for crawlers that support it, can reflect verification status, so a fake Googlebot is not silently counted as the real one.

Common mistakes

Granting fake Googlebot the access or content you reserve for the real one.
Counting browser-spoofing scrapers as human visits.
Blocking by user-agent substring, which spoofers simply change.

Privacy and accuracy notes

Verification uses network-level signals (reverse DNS, published IP ranges), not visitor identity. WebmasterID records the verification outcome as a bot signal and never builds a human profile from it.

↑ All user-agent families in User agents

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.