NeuralCrawl

πŸ‡ΊπŸ‡Έ BrightEdge

brightedge.com · SEO & AI search · rank #12 · SEO software · live robots.txt ↗

AI crawler access (latest snapshot, 4h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 2512 bytes · sha256 51c4292d739e · raw

# =================================================
# robots.txt
# =================================================

User-agent: *

# -------------------------------------------------
# ALLOW rendering assets
# -------------------------------------------------
Allow: /core/*.css
Allow: /core/*.js
Allow: /core/*.gif
Allow: /core/*.jpg
Allow: /core/*.jpeg
Allow: /core/*.png
Allow: /core/*.svg

Allow: /profiles/*.css
Allow: /profiles/*.js
Allow: /profiles/*.gif
Allow: /profiles/*.jpg
Allow: /profiles/*.jpeg
Allow: /profiles/*.png
Allow: /profiles/*.svg

Allow: /themes/*.css
Allow: /themes/*.js

# -------------------------------------------------
# BLOCK admin & authentication
# -------------------------------------------------
Disallow: /admin/
Disallow: /user/login
Disallow: /user/logout
Disallow: /user/register
Disallow: /user/password

# -------------------------------------------------
# BLOCK content creation & system actions
# -------------------------------------------------
Disallow: /node/add/
Disallow: /comment/reply/
Disallow: /filter/tips
Disallow: /media/oembed
Disallow: /*/media/oembed

# -------------------------------------------------
# BLOCK internal search & result pages
# -------------------------------------------------
Disallow: /search/
Disallow: /index.php/search/

# -------------------------------------------------
# BLOCK non-clean URL duplicates
# -------------------------------------------------
Disallow: /index.php/admin/
Disallow: /index.php/comment/reply/
Disallow: /index.php/filter/tips
Disallow: /index.php/node/add/
Disallow: /index.php/user/login
Disallow: /index.php/user/logout
Disallow: /index.php/user/register
Disallow: /index.php/user/password
Disallow: /index.php/media/oembed
Disallow: /index.php/*/media/oembed

# -------------------------------------------------
# BLOCK documentation & internal files
# -------------------------------------------------
Disallow: /README.md
Disallow: /composer/
Disallow: /modules/README.txt
Disallow: /sites/README.txt
Disallow: /themes/README.txt

# -------------------------------------------------
# BLOCK common exploit & scan paths
# -------------------------------------------------
Disallow: /vendor/
Disallow: /.env
Disallow: /.git
Disallow: /.svn
Disallow: /.hg
Disallow: /backup/
Disallow: /backups/
Disallow: /tmp/
Disallow: /private/

# -------------------------------------------------
# SITEMAP
# -------------------------------------------------
Sitemap: https://www.brightedge.com/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived