Robots & crawl control

The Clean-param directive in robots.txt explained

Clean-param is a Yandex-specific robots.txt directive that lists URL query parameters Yandex should ignore when crawling, helping consolidate duplicate URLs. This page explains its syntax, what it does, and why Google relies on different mechanisms.

Verified against primary sources

What Clean-param does

Clean-param is a directive Yandex recognises in robots.txt. It lets you name query-string parameters that do not change page content — such as tracking tags or session identifiers — so Yandex treats URLs differing only in those parameters as the same resource. This reduces duplicate crawling and helps consolidate ranking signals for Yandex.

It is Yandex-specific. Other crawlers, including Googlebot, do not act on Clean-param, so it is not a general-purpose parameter-handling tool.

Yandex-specific robots.txt directive
Names parameters that do not change content
Helps Yandex consolidate duplicate parameterised URLs

Syntax and alternatives

The directive takes a parameter list and an optional path prefix:

Clean-param: utm_source&utm_medium /catalog/

This tells Yandex to ignore those parameters for URLs under /catalog/. Multiple parameters are joined with &, and you can repeat the directive for different paths.

For Google and most other engines, consolidate duplicate parameterised URLs with rel=canonical tags and consistent internal linking instead — Google retired its old URL parameters tool and relies on canonicalisation signals.

How it appears in analytics and logs

Clean-param is a Yandex-specific instruction read from robots.txt. It does not change other crawlers' behavior; for Yandex, it can reduce redundant crawling of parameterised URL variants.

Diagnostic use case

Tell Yandex to ignore tracking or session parameters (like utm_source or sort order) so duplicate parameterised URLs are crawled and indexed as a single canonical resource.

What WebmasterID can help detect

WebmasterID reports crawler hits per URL, so you can see whether parameterised variants are being crawled — context for deciding whether a Clean-param rule for Yandex would help.

Common mistakes

Expecting Google to honour Clean-param — it is Yandex-specific.
Listing parameters that actually change content, causing distinct pages to be merged.
Forgetting the optional path prefix and applying it more broadly than intended.

Privacy and accuracy notes

Clean-param concerns how your own URLs are crawled. It involves no visitor data and is not an access-control mechanism.

↑ All robots topics in Robots & crawl control

Sources and verification notes

Yandex — Clean-param directive documentationDocuments the Clean-param syntax and behavior for Yandex.

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.