WebmasterID logoWebmasterID
Robots & crawl control

How to opt out of Apple AI with Applebot-Extended

Applebot-Extended is a robots.txt token Apple provides so site owners can opt out of having content used to train Apple's generative AI models. It is a standalone control: disallowing Applebot-Extended does not stop Applebot, which keeps crawling for Apple search features and Siri.

Verified against primary sources

What Applebot-Extended is

Applebot-Extended is a robots.txt token Apple introduced so publishers can control whether their content is used to train Apple's generative AI models. It is layered on top of the existing Applebot crawler rather than being a separate fetcher.

Applebot itself is the user agent Apple uses for Search and Siri. Setting an Applebot-Extended rule governs only the AI-training use of content Applebot has already fetched.

The rule and what it does not touch

To opt out of generative AI training site-wide:

User-agent: Applebot-Extended Disallow: /

Apple documents that this does not stop Applebot from crawling for search and Siri — control that via the separate Applebot token. Because Applebot-Extended is not a fetcher, you will not see it as a user agent in logs.

How it appears in analytics and logs

Applebot-Extended is a policy token, not a fetcher you will see by that name in logs. Disallowing it signals an AI-training opt-out; Applebot is the user agent that continues to appear.

Diagnostic use case

Opt out of Apple's generative-AI training use of your content while keeping Applebot crawling for Apple search and Siri intact.

What WebmasterID can help detect

WebmasterID keeps Applebot and AI-control activity distinct in your view, so you can confirm opting out of Applebot-Extended did not disturb normal Applebot crawling.

Common mistakes

Privacy and accuracy notes

This is a content-usage policy choice in a public file. It does not involve visitor data and is not a security control.

Related pages

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.