NeuralCrawl

πŸ‡ΊπŸ‡Έ Vectra AI

vectra.ai · Cybersecurity · rank #55 · Cybersecurity · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1013 bytes · sha256 866fc812cdbf · raw

User-agent: *
Disallow: /*?utm_source=
Disallow: /*?utm_medium=
Disallow: /*?utm_campaign=
Disallow: /*?utm_content=
Disallow: /*?utm_term=
Disallow: /*?utm_id=
Disallow: /*?_hsenc=
Disallow: /*?_hssc=
Disallow: /*?_hstc=
Disallow: /*?__hstc=
Disallow: /*?__hssc=
Disallow: /*?__hsfp=
Disallow: /*?trk=
Disallow: /*?ref=
Disallow: /*?Filter=
Disallow: /*?language-e3zl=
Disallow: /*?language-ed88=
Disallow: /*?a_aid=
Disallow: /*?affiliate_id=
Disallow: /*?cd8e8676_page=
Disallow: /*?984b3f83_page=
Disallow: /*?5b46b72a_page=
Disallow: /*?48f7950e_page=
Disallow: /*?gh_src=
Disallow: /*?gh_jid=
Disallow: /*?data1=
Disallow: /*?fr=
Disallow: /*?via=
Disallow: /*?top11=
Disallow: /*?text=
Disallow: /*?dgc=
Disallow: /*&ref=
Disallow: /*&_ga=
Disallow: /*&_ga=

User-agent: cache_warmer (+https://www.cache-warmer.com)
Disallow: /

User-agent: MJ12bot
Disallow: /

User-agent: IsDownBot
Disallow: /

# Allow search engines to access sitemap
Allow: /sitemap.xml

Sitemap: https://www.vectra.ai/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived