NeuralCrawl

πŸ‡ΊπŸ‡Έ xAI

x.ai · Top 1000 websites · rank #6 · AI Chatbots and Tools · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 407 bytes · sha256 8736a6736453 · raw

User-agent: *
Allow: /
Disallow: /tools/

User-agent: GPTBot
User-agent: ChatGPT-User
User-agent: PerplexityBot
User-agent: ClaudeBot
User-agent: Google-Extended
User-agent: Applebot-Extended
Allow: /
Disallow: /tools/

Sitemap: https://x.ai/sitemap.xml

# Content Signals (draft) β€” declare AI/search usage preferences
# See: https://contentsignals.org/
Content-Signal: ai-train=no, search=yes, ai-input=no

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived