Add explicit 'User-agent: <bot>' blocks followed by 'Allow: /' for each major AI crawler: GPTBot, ClaudeBot, PerplexityBot, Google-Extended, Applebot-Extended, Bytespider, Meta-ExternalAgent and CCBot. End the file with a Sitemap directive pointing to your XML sitemap.
User-agent: GPTBot / Allow: / / User-agent: ClaudeBot / Allow: / / ... and so on for each AI crawler. Then User-agent: * / Allow: / / Sitemap: https://example.com/sitemap.xml.
Inheriting overly aggressive bot rules from a CDN config or WAF that block GPTBot accidentally. Forgetting Google-Extended (it is distinct from Googlebot). Not declaring the sitemap in robots.txt.
Use the AI Agent Readiness Check which tests crawler access for all major AI bots and reports any blocks.
Run the free 50-signal AI Agent Readiness Check or book a free scoping call.
Score my site