WebmasterID logoWebmasterID
Robots & crawl control

How to block the SISTRIX crawler

SISTRIX runs a crawler to gather SEO visibility and ranking data for its platform. This page shows how to disallow the SISTRIX crawler in robots.txt, how to throttle it instead of blocking, and how to confirm the directive is honoured.

Verified against primary sources

robots.txt rule

SISTRIX documents the token its crawler uses. To disallow it site-wide:

User-agent: SISTRIX Crawler Disallow: /

Confirm the exact token from your access logs, because SISTRIX has used more than one identifier over time. SISTRIX documents that its crawler respects robots.txt, so a correctly targeted Disallow is the supported opt-out.

Throttle as an alternative

If you would rather slow the SISTRIX crawler than block it, SISTRIX documents crawl-delay support:

User-agent: SISTRIX Crawler Crawl-delay: 10

This asks the crawler to space out its requests. Keep the directive inside the SISTRIX group; crawl-delay is honoured only by crawlers that support it and is ignored by Googlebot.

How it appears in analytics and logs

SISTRIX crawler hits mean an SEO analytics platform is collecting data about your pages. It is third-party tooling, not search indexing, and brings no organic referral traffic.

Diagnostic use case

Prevent the SISTRIX crawler from consuming crawl resources or mapping your site when you do not use the SISTRIX toolbox.

What WebmasterID can help detect

WebmasterID classifies the SISTRIX crawler as an SEO crawler, so you can verify a block takes effect and keep these requests out of human analytics.

Common mistakes

Privacy and accuracy notes

The rule matches the SISTRIX crawler token only. No visitor data is involved, and robots.txt is a request to compliant crawlers rather than an access control.

Related pages

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.