NeuralCrawl

πŸ‡ΊπŸ‡Έ seoClarity

seoclarity.net · SEO & AI search · rank #17 · SEO software · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 431 bytes · sha256 ea31e59a008c · raw

User-agent: *
Disallow: /thank-you
Disallow: /thank-you/
Disallow: /thank-you*
Disallow: /*thank-you$
Allow: /seo-platform-essentials/confirmation-lead/thank-you
Disallow: /competitive-insights-report-3/
Disallow: /see-it-in-action-enterprise-v1
Disallow: /workflow-video-library
Disallow: /_hcms/preview/
Disallow: /hs/manage-preferences/
Disallow: /hs/preferences-center/
Disallow: /*?*hs_preview=*
Disallow: /*?*hsCacheBuster=*

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived