AI Crawlers & Bots

CCBot

Definition

CCBot is Common Crawl's crawler. Common Crawl is the largest publicly-available web corpus and is used as training data by virtually every major LLM including those at OpenAI, Anthropic, Meta and Google. Blocking CCBot removes the site from this foundational training data source.

← Back to full glossary

Want help shipping AEO into your site?

Run the free 50-signal AI Agent Readiness Check or talk to our AEO team.

Score my site