WebmasterID logoWebmasterID
Search bots

Censys and Shodan scanning crawlers

Censys and Shodan are internet-wide scanning services that map reachable hosts, open ports, and exposed services for security research and asset discovery. They are not search-engine crawlers indexing your content for ranking; they probe infrastructure. Their requests appear in logs as scanning activity from their published scanner identities, and they offer opt-out mechanisms for operators.

Partially verified

What this means

Censys and Shodan continuously scan the public internet to build searchable indexes of hosts and services — what ports are open, what banners services return, and which devices are reachable. Security teams use these to find their own exposed assets; researchers use them to study the internet at large.

This is fundamentally different from a search crawler. Censys and Shodan are not reading your articles to rank them; they are characterising your infrastructure's externally visible surface.

How they identify themselves

Both services publish information about their scanning and provide opt-out paths. Their probes can carry self-identifying user-agents or originate from documented scanner infrastructure, and each publishes guidance on recognising and excluding their scans.

Because exact tokens and IP ranges vary across their scanner fleets and change over time, this entry is marked partially verified. The reliable signal is the scanning behaviour plus each provider's published scanner documentation; corroborate with their opt-out and identification pages.

What to do about it

If you do not want to appear in these indexes, follow each provider's documented opt-out or exclusion process rather than relying on robots.txt, which governs content crawlers and does not control port scanning.

Treat scanning as a prompt to review your external exposure: if a service is reachable and indexed by a scanner, it is reachable by anyone. The scan itself is a visibility signal, not an attack.

How it appears in analytics and logs

A Censys or Shodan request is an infrastructure scan probing reachable services, not a content crawl for search. It is automated scanning traffic and should be treated as bot/recon activity, not audience or SEO crawling.

Diagnostic use case

Recognise internet-scanning probes from Censys and Shodan in logs, distinguish them from search indexing and SEO crawlers, and decide whether to request exclusion.

What WebmasterID can help detect

WebmasterID classifies recognised scanner probes server-side as bot/monitoring traffic and shows which endpoints they reached, so security scanning does not blend into human analytics or search-crawl coverage.

Common mistakes

Privacy and accuracy notes

Scanner identification uses the request user-agent and published scanner information only. No visitor identity is involved. WebmasterID records the probe as a bot event, separate from human analytics.

Related pages

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.