NeuralCrawl

๐Ÿ‡ฉ๐Ÿ‡ช Evonik

evonik.com · European companies · rank #144 · Chemicals · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1201 bytes · sha256 2ff9f4fc7b1f · raw

# Start Group: crawler / LLMO
User-Agent: Bingbot
User-Agent: GPTBot
User-Agent: ClaudeBot
User-Agent: Claude-User
User-Agent: Claude-SearchBot
User-Agent: CCBot
User-Agent: Google-Extended
User-Agent: Applebot-Extended
User-Agent: Facebookbot
User-Agent: Meta-ExternalAgent
User-Agent: Meta-ExternalFetcher
User-Agent: diffbot
User-Agent: PerplexityBot
User-Agent: TikTokSpider
User-Agent: Amazonbot
User-Agent: OAI-SearchBot
User-Agent: Google-CloudVertexBot
User-Agent: DuckAssistBot
User-Agent: GoogleAgent-Mariner
User-Agent: Gemini-Deep-Research
User-Agent: Google-NotebookLM
User-Agent: Google-Agent
User-Agent: GoogleAgent-URLContext
User-Agent: Google-Firebase
User-Agent: MistralAI-User
User-Agent: Spacecat/1.0
Disallow: /en/my-evonik/
Disallow: /en/incident/
Disallow: /de/incident/
Disallow: /en/error/
Allow: /
# End Group: crawler / LLMO
# Start Group: known Spam-Bots
User-agent: MJ12bot
Disallow: /
# End Group: known Spam-Bots
# Start Group: all others
User-agent: *
Disallow: /en/my-evonik/
Disallow: /en/incident/
Disallow: /de/incident/
Disallow: /en/error/
Allow: /
# End Group: all

Sitemap: https://www.evonik.com/en.sitemap.xml 
Sitemap: https://www.evonik.com/de.sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived