NeuralCrawl

๐Ÿ‡ฌ๐Ÿ‡ง Majestic

majestic.com · SEO & AI search · rank #37 · SEO intelligence · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 489 bytes · sha256 368886dcd808 · raw

#robots.txt Rev #9 20230620

User-agent: *
Disallow: /business/
Disallow: /forbidwarn
Disallow: /infectwarn


User-agent: MJ12bot
Disallow: /forbidwarn
Disallow: /infectwarn
Allow: /business/

User-agent: Googlebot
Disallow: /forbidwarn
Disallow: /infectwarn
Allow: /business/

User-agent: bingbot
Disallow: /forbidwarn
Disallow: /infectwarn
Allow: /business/

# banned bots.
User-agent: Vegi bot
User-agent: seostats
User-agent: MauiBot
User-agent: TechSEO360
Crawl-delay: 20
Disallow: /

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived