NeuralCrawl

๐Ÿ‡ณ๐Ÿ‡ฑ Startpage

startpage.com · SEO & AI search · rank #52 · Search engine · live robots.txt ↗

AI crawler access (latest snapshot, 4h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 188 bytes · sha256 795bee4d79cf · raw

User-agent: *
Allow: /sp/cdn/images/
Allow: /sp/cdn/favicons/
Disallow: /cgi-bin/
Disallow: /do/
Disallow: /sp/
Disallow: /av/
Noindex: /cgi-bin/
Noindex: /do/
Noindex: /sp/
Noindex: /av/

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived