WebmasterID logoWebmasterID
Robots & crawl control

How to block AI2Bot in robots.txt

AI2Bot is a crawler associated with the Allen Institute for AI (AI2), which produces open AI research and datasets. This page gives the robots.txt rule to disallow its token and stays cautious where public documentation is limited, marking unverified specifics rather than guessing.

Partially verified

What AI2Bot is

AI2Bot is a crawler associated with the Allen Institute for AI (AI2), a research organisation known for open models and datasets. Because AI2's outputs are research-oriented and often openly published, blocking AI2Bot reduces your presence in that crawling. Where AI2's public documentation is incomplete, this entry avoids asserting specifics that cannot be confidently sourced.

The rule

To disallow AI2Bot site-wide, target its token:

User-agent: AI2Bot Disallow: /

Treat this as a forward-looking request and confirm in your logs that requests stop. robots.txt is honoured by compliant crawlers but cannot force compliance, and it is not an access-control boundary. Do not invent IP ranges to verify it.

How it appears in analytics and logs

A request carrying the AI2Bot token is a crawler associated with the Allen Institute for AI fetching a URL. After a disallow, confirm in logs whether AI2Bot activity actually stops.

Diagnostic use case

Disallow AI2Bot to keep your content out of crawling associated with AI2's open research datasets.

What WebmasterID can help detect

WebmasterID classifies AI2Bot by its token, separate from human traffic, so you can see whether a disallow rule changed its activity.

Common mistakes

Privacy and accuracy notes

Blocking AI2Bot is a publishing-policy choice in a public file. It involves no visitor data and is not access control.

Related pages

Sources and verification notes

Last reviewed 2026-06-24. Facts are checked against primary/official sources where available; uncertain specifics are marked “Data not yet verified” rather than guessed.