NeuralCrawl

๐Ÿ‡ซ๐Ÿ‡ท Qwant

qwant.com · SEO & AI search · rank #51 · Search engine · live robots.txt ↗

AI crawler access (latest snapshot, 4h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 2280 bytes · sha256 d768f6d48418 · raw

#Robots.txt - www.qwant.com

User-agent: *
Disallow: /?*q=*
Disallow: /account

#AI Bots
User-agent: AI2Bot
User-agent: AI2Bot-DeepResearchEval
User-agent: Ai2Bot-Dolma
User-agent: Amazonbot
User-agent: AmazonBuyForMe
User-agent: anthropic-ai
User-agent: Applebot-Extended
User-agent: bigsur.ai
User-agent: Bytespider
User-agent: CCBot
User-agent: ChatGLM
User-agent: ChatGLM-Spider
User-agent: ChatGPT-Operator
User-agent: ChatGPT-User
User-agent: ClaudeBot
User-agent: Claude-User
User-agent: Claude-Web
User-agent: CloudVertexBot
User-agent: cohere-ai
User-agent: cohere-training-data-crawler
User-agent: Cotoyogi
User-agent: DataForSeoBot
User-agent: Datenbank Crawler
User-agent: Devin
User-agent: Diffbot
User-agent: DuckAssistBot
User-agent: FacebookBot
User-Agent: FriendlyCrawler
User-agent: gemini-deep-research
User-agent: GoogleAgent-Mariner
User-agent: Google-Extended
User-agent: Google-NotebookLM
User-agent: GPTBot
User-agent: ICC-Crawler
User-agent: ImagesiftBot
User-agent: imageSpider
User-agent: img2dataset
User-agent: Kangaroo Bot
User-agent: KlaviyoAIBot
User-agent: laion-huggingface-processor
User-agent: LCC
User-agent: LinerBot
User-agent: Manus-User
User-agent: meta-externalagent
User-agent: meta-externalads
User-agent: meta-externalfetcher
User-agent: meta-webindexer
User-agent: MistralAI-User
User-agent: netEstate
User-agent: netEstate Imprint Crawler
User-agent: NovaAct
User-agent: omgili
User-agent: OmigiliBot
User-agent: PanguBot
User-agent: Perplexity-User
User-agent: PhindBot
User-agent: Poggio-Citations
User-agent: QualifiedBot
User-agent: SBIntuitionsBot
User-Agent: SemrushBot-SWA
User-agent: sider.ai
User-agent: Spider
User-agent: TavilyBot
User-agent: TheKnowledgeAI
User-agent: TimpiBot
User-agent: TwinAgent
User-agent: VelenPublicWebCrawler
User-agent: webzio-extended
User-agent: YouBot
Disallow: /?*q=*
Disallow: /?drawer=*

#Bots
User-agent: AhrefsBot
User-agent: SearchmetricsBot
User-agent: rogerbot
User-agent: deepcrawl
User-Agent: OnCrawl
User-agent: Screaming Frog SEO Spider
User-agent: MJ12bot
User-Agent: Sistrix
User-agent: barkrowler
Disallow: /

#Ads Bots
User-agent: Mediapartners-Google
User-agent: Google-Display-Ads-Bot
Disallow:

#Social Bots
User-agent: Twitterbot
User-agent: facebookexternalhit
Disallow:

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived