NeuralCrawl

๐Ÿ‡ณ๐Ÿ‡ฑ Ahold Delhaize

aholddelhaize.com · European companies · rank #108 · Retail · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 368 bytes · sha256 09bbc1bb4ea6 · raw

User-Agent: *
Disallow:
Allow: /

# Block admin and technical paths
Disallow: /umbraco/
Disallow: /_nuxt/
Disallow: /api/
Disallow: /admin/
Disallow: /private/
Disallow: /*.json$
Disallow: /*.xml$
Disallow: /node_modules/
Disallow: /health
Disallow: /server/
Disallow: /uSync/

# TODO replace with actual sitemap url
Sitemap: https://www.aholddelhaize.com/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived