NeuralCrawl

πŸ‡ΊπŸ‡Έ Perplexity

perplexity.ai · Top 1000 websites · rank #8 · AI Chatbots and Tools · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1218 bytes · sha256 6fe555549ff6 · raw

# https://www.robotstxt.org/robotstxt.html
User-agent: *
Disallow: /*?*q=
Disallow: /search/new
Disallow: /search?*/
Disallow: /socket.io/
Disallow: /onboarding/
Disallow: /join/
Allow: /academic
Allow: /api-platform
Allow: /assistant
Allow: /comet
Allow: /comet/gettingstarted
Allow: /comet/resources
Allow: /enterprise
Allow: /finance
Allow: /hub
Allow: /hub/careers
Allow: /hub/careers/interview-guide
Allow: /hub/deep-research
Allow: /hub/products/app-connectors/snowflake
Allow: /hub/products/computer-for-windows
Allow: /hub/products/integrations/microsoft
Allow: /pro
User-agent: Googlebot
Disallow: /*?*q=
Disallow: /search*
Disallow: /search?*/
Disallow: /socket.io/
Disallow: /onboarding/
Disallow: /join/
Allow: /academic
Allow: /api-platform
Allow: /assistant
Allow: /comet
Allow: /comet/gettingstarted
Allow: /comet/resources
Allow: /enterprise
Allow: /finance
Allow: /hub
Allow: /hub/careers
Allow: /hub/careers/interview-guide
Allow: /hub/deep-research
Allow: /hub/products/app-connectors/snowflake
Allow: /hub/products/computer-for-windows
Allow: /hub/products/integrations/microsoft
Allow: /pro
Sitemap: https://www.perplexity.ai/sitemap.xml
Sitemap: https://www.perplexity.ai/help-center/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived