WebmasterID logoWebmasterID
User agents

User agents and bot scores

Bot-detection and WAF products often compute a bot score that estimates how likely a request is automated. The user agent is one input, but a weak one on its own: it is trivially spoofable and frequently blank or generic on legitimate clients. Over-weighting the UA leads to both missed bots and blocked humans.

Partially verified

What this means

A bot score is a probabilistic estimate that a request came from automation rather than a human, used by WAFs and anti-bot products to decide whether to allow, challenge, or block. These systems blend many inputs, and the user agent is one of them.

On its own the user agent is a weak signal. It is self-reported and trivially editable, so a scraper can present a perfect browser string while a legitimate API client honestly names its library. Treating the UA as decisive distorts the score.

Using the UA proportionately

The user agent is most useful in combination: a library token plus datacenter origin plus non-browser request patterns together make a confident automation signal, whereas any one alone does not. Conversely, a perfect browser UA paired with impossible navigation timing suggests spoofing.

Because exact scoring formulas are proprietary and vary by vendor, we do not assert specific weightings here. The durable principle is to corroborate the user agent with network and behavioural signals rather than letting the string dominate, and to keep an explainable basis for any block.

How it appears in analytics and logs

A bot score derived heavily from the user agent will rate a spoofed browser string as human and an honest library token as bot. The UA contributes signal but cannot carry a scoring decision alone.

Diagnostic use case

Understand the user agent's limited role in bot scoring, avoid over-weighting it, and combine it with behavioural and network signals for better accuracy.

What WebmasterID can help detect

WebmasterID classifies traffic deterministically from the user agent and corroborating signals rather than emitting an opaque probability, so you can see exactly why a request was labelled a crawler, automation, or human.

Common mistakes

Privacy and accuracy notes

Bot scoring should rest on request and behavioural signals, not on identifying the person behind a request. The user agent is coarse client metadata, and WebmasterID keeps bot classification deterministic and non-identifying.

Related pages

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.