NeuralCrawl

๐Ÿ‡ฌ๐Ÿ‡ง Sophos

sophos.com · Cybersecurity · rank #39 · Cybersecurity · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1720 bytes · sha256 6632a56ec3c4 · raw

# robots.txt for www.sophos.com (generated November 2025)

# Disallow internal or non-public endpoints (update these for content stack)
#User-agent: *
#Disallow: /admin/
#Disallow: /internal/
#Disallow: /private/
#Disallow: /error/

# Allow all other content, including documentation, downloads and product pages.
Allow: /

#  Allow the LLM policy file
Allow: /llms.txt

# Sitemaps for supported locales
Sitemap: https://www.sophos.com/en-us/sitemap.xml
Sitemap: https://www.sophos.com/en-gb/sitemap.xml
Sitemap: https://www.sophos.com/de-de/sitemap.xml
Sitemap: https://www.sophos.com/es-es/sitemap.xml
Sitemap: https://www.sophos.com/it-it/sitemap.xml
Sitemap: https://www.sophos.com/fr-fr/sitemap.xml
Sitemap: https://www.sophos.com/ja-jp/sitemap.xml
Sitemap: https://www.sophos.com/pt-br/sitemap.xml
Sitemap: https://www.sophos.com/zh-cn/sitemap.xml

# video sitemap
Sitemap: https://www.sophos.com/video-sitemap.xml


# Allow AI Search and Agent Crawlers

User-agent: Anthropic-ai
Allow: /

User-agent: ClaudeBot
Allow: /

User-agent: Bingbot
Allow: /

User-agent: BingAI
Allow: /

User-agent: BingPreview
Allow: /

User-agent: ChatGPT-User
Allow: /

User-agent: CopilotBot
Allow: /

User-agent: Googlebot
Allow: /

User-agent: Google-Extended
Allow: /

User-agent: GPTBot
Allow: /

User-agent: LlamaBot
Allow: /

User-agent: OAI-SearchBot
Allow: /

User-agent: PerplexityBot
Allow: /

User-agent: xAI-Bot
Allow: /

User-agent: Amazonbot
Allow: /

User-agent: FirecrawlAgent 
Allow: / 

User-agent: AndiBot 
Allow: / 

User-agent: ExaBot 
Allow: / 

User-agent: PhindBot 
Allow: / 

User-agent: YouBot 
Allow: /

User-agent: PetalBot
Disallow: /

# Disallow specific LLM crawler
User-agent: DeepSeekBot
Disallow: /

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived