NeuralCrawl

πŸ‡ΊπŸ‡Έ OneTrust

onetrust.com · Cybersecurity · rank #30 · Cybersecurity · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 397 bytes · sha256 ee6cff5e0fce · raw

Sitemap: https://www.onetrust.com/sitemap.xml

User-agent: *
Disallow: /terms/
Disallow: /support/
Disallow: /signature/
Disallow: /product-documentation/
Disallow: /pdf/
Disallow: /paypal/
Disallow: /operations/
Disallow: /hr/
Disallow: /email/
Disallow: /dataguidance/
Disallow: /client-logos/
Disallow: /assets/
Disallow: /build/
Disallow: /sales-demo-video/
Disallow: /webex/
Disallow: /*.xls

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived