NeuralCrawl

πŸ‡ΊπŸ‡Έ SentinelOne

sentinelone.com · Cybersecurity · rank #7 · Cybersecurity · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 814 bytes · sha256 beb5d1a712e5 · raw

User-agent: *
Disallow: /wp-admin/
Allow: /wp-admin/admin-ajax.php
Disallow: /*?*
Allow: /resources/?*
Disallow: *.pdf$
Disallow: /static/lp/s1-value-app-2024/


# Google News-specific exclusion
User-agent: Googlebot-News
Disallow: /
Allow: /press/
Allow: /blog/


# Sitemap Indexes
Sitemap: https://www.sentinelone.com/sitemap_index.xml
Sitemap: https://www.sentinelone.com/nl/sitemap_index.xml
Sitemap: https://www.sentinelone.com/de/sitemap_index.xml
Sitemap: https://www.sentinelone.com/es/sitemap_index.xml
Sitemap: https://www.sentinelone.com/it/sitemap_index.xml
Sitemap: https://www.sentinelone.com/fr/sitemap_index.xml
Sitemap: https://www.sentinelone.com/ja/sitemap_index.xml
Sitemap: https://www.sentinelone.com/ko/sitemap_index.xml

# CVE DB
Sitemap: https://www.sentinelone.com/cve-sitemaps/index.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived