AI crawlers

Bytespider — ByteDance crawler

Bytespider is a web crawler affiliated with ByteDance. Its robots.txt token is Bytespider, and it appears in server logs as an automated fetcher. Public documentation is limited, so some specifics about its purpose and behaviour are marked partially verified rather than guessed.

Partially verified

What this means

Bytespider is a crawler affiliated with ByteDance. It appears in server logs as an automated fetcher carrying the Bytespider token.

Public documentation about Bytespider is comparatively limited. For that reason, this entry describes the stable identification pattern and avoids asserting specifics — such as exact crawl purpose or scope — that cannot be confidently sourced. Treat any such specifics you read elsewhere with caution.

How Bytespider identifies itself

Bytespider uses the robots.txt user-agent token Bytespider. Its user-agent string contains that token. Match on the stable token rather than a full version string.

The user agent is a claim and can be copied. Because verification guidance is not clearly published, do not assume IP-range authenticity, and do not invent IP ranges. Identify it by the token and treat trust-sensitive decisions conservatively.

robots.txt token: Bytespider
Affiliated with ByteDance
Specifics beyond the token: not fully documented publicly

robots.txt considerations

To disallow Bytespider site-wide, target its token:

User-agent: Bytespider Disallow: /

Whether and how strictly Bytespider honours robots.txt is not as clearly documented as for major crawlers, so this rule is best treated as a request rather than a guarantee. robots.txt is never an access-control boundary.

How it appears in analytics and logs

A request carrying the Bytespider token is a ByteDance-affiliated crawler fetching a URL — a bot event, not a human visit. Because public docs are limited, treat claims about its exact purpose conservatively and rely on the token for identification.

Diagnostic use case

Identify Bytespider activity in logs by its token and decide robots.txt policy, while treating undocumented specifics with caution.

What WebmasterID can help detect

WebmasterID classifies Bytespider server-side by its token and surfaces it on the bot-intelligence surface as an AI/dataset crawler, so you can see its activity per page without parsing logs.

Common mistakes

Assuming documented behaviour exists where public docs are actually sparse.
Inventing IP ranges to verify Bytespider — none should be fabricated.
Counting crawler hits as human sessions.

Privacy and accuracy notes

Detection uses only the request user-agent. No human identity is involved. WebmasterID records the crawl as a bot event, separate from human analytics, and never attaches it to a visitor profile.

↑ All AI crawlers in AI crawlers

Sources and verification notes

ByteDance — crawler reference (token observed in logs)Token Bytespider is observed; comprehensive official docs are limited, so some specifics are marked partially verified.

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.