NeuralCrawl

πŸ‡ΊπŸ‡Έ ORCID

orcid.org · Academic & open research · rank #19 · Researcher identifiers · live robots.txt ↗

AI crawler access (latest snapshot, 3h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 551 bytes · sha256 bdf122a71251 · raw

User-agent: *
Disallow: /2FA/setup
Disallow: /account
Disallow: /css/index.php
Disallow: /developer-tools
Disallow: /http*
Disallow: /https*
Disallow: /inbox
Disallow: /institutional-signin
Disallow: /my-orcid
Disallow: /oauth/authorize*
Disallow: /oauth/signin
Disallow: /orcid-search/
Disallow: /orgs/disambiguated/
Disallow: /oauth
Disallow: /qr-code.png
Disallow: /reactivation
Disallow: /register
Disallow: /repeater.php
Disallow: /social-linking
Disallow: /trusted-parties
Disallow: /wp-admin/
Disallow: /wp-includes/wp-class.php
Crawl-delay: 1

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived