NeuralCrawl

πŸ‡ΊπŸ‡Έ ExtraHop

extrahop.com · Cybersecurity · rank #56 · Cybersecurity · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 481 bytes · sha256 85b71efe4f12 · raw

# robots.txt
User-agent: *                   #Applies to all bots
Disallow: /*?*                  #Blocks parameters from being crawled

#Sanity
Disallow: /studio

#Allow image resources
Allow: /_next/image?url=*

#Specific pages
Disallow: /demo-success
Disallow: /contact-success
Disallow: /partners/locator

#Legal pages
Disallow: /legal/*

# Ahrefs crawlers
User-agent: AhrefsSiteAudit
Allow: /

User-agent: AhrefsBot
Allow: /

Sitemap: https://www.extrahop.com/api/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived