CCBot is Common Crawl's crawler. Common Crawl is the largest publicly-available web corpus and is used as training data by virtually every major LLM including those at OpenAI, Anthropic, Meta and Google. Blocking CCBot removes the site from this foundational training data source.
Run the free 50-signal AI Agent Readiness Check or talk to our AEO team.
Score my site