NeuralCrawl

πŸ‡ΊπŸ‡Έ Pinecone

pinecone.io · Top 1000 websites · rank #35 · AI Chatbots and Tools · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 357 bytes · sha256 819a97f513d1 · raw

# *
User-agent: *
Disallow: /admin/*
Disallow: /partners/referral-agreement.pdf
Disallow: /partners/affiliate-agreement.pdf
Disallow: /pinecone-brand.pdf

Disallow: /lp/pinecone-vector-database
Disallow: /lp/pinecone-vector-database-enterprise

Allow: /api/og/*

# Host
Host: https://www.pinecone.io

# Sitemaps
Sitemap: https://www.pinecone.io/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived