NeuralCrawl

πŸ‡ΊπŸ‡Έ Claude

claude.ai · Top 1000 websites · rank #52 · LLM · live robots.txt ↗

AI crawler access (latest snapshot, 1h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 408 bytes · sha256 7edaaba79d5d · raw

User-Agent: ia_archiver
Allow: /$
Allow: /login
Disallow: /

User-Agent: GPTBot
User-Agent: OAI-SearchBot
User-Agent: ChatGPT-User
Disallow: /

User-Agent: *
Disallow: /new?*
Disallow: /chat/*
Disallow: /share/*
Disallow: /join/*
Disallow: /magic-link*
Disallow: /api/*
Disallow: /onboarding*
Disallow: /upgrade*
Disallow: /lti/*
Disallow: /settings*
Disallow: /task*

Sitemap: https://claude.ai/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived