NeuralCrawl

๐Ÿ‡ฏ๐Ÿ‡ต Keyence

keyence.com · Top 1000 websites · rank #845 · Industrials · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1411 bytes · sha256 851d50591706 · raw

User-agent: Mozilla/4.5 (compatible; Search/crawler;)
Crawl-Delay: 1

Disallow: /doptinUlExpired.jsp$
Disallow: /error.jsp$
Disallow: /autoSignUp/
Disallow: /activation/
Disallow: /ajax/
Disallow: /common/
Disallow: /dlfile/
Disallow: /download/directDownload/
Disallow: /download/downloadCart/
Disallow: /enquete/
Disallow: /include/
Disallow: /php/
Disallow: /pv/
Disallow: /simpleInquiry/
Disallow: /template/
Disallow: /Images/products_nav_*.png
Disallow: /cscz/Images/products_nav_*.png
Disallow: /dede/Images/products_nav_*.png
Disallow: /frfr/Images/products_nav_*.png
Disallow: /huhu/Images/products_nav_*.png
Disallow: /nlnl/Images/products_nav_*.png
Disallow: /plpl/Images/products_nav_*.png

User-agent: *
Crawl-Delay: 5

Disallow: /doptinUlExpired.jsp$
Disallow: /error.jsp$
Disallow: /autoSignUp/
Disallow: /activation/
Disallow: /ajax/
Disallow: /common/
Disallow: /dlfile/
Disallow: /download/directDownload/
Disallow: /download/downloadCart/
Disallow: /enquete/
Disallow: /include/
Disallow: /php/
Disallow: /pv/
Disallow: /simpleInquiry/
Disallow: /template/
Disallow: /Images/products_nav_*.png
Disallow: /cscz/Images/products_nav_*.png
Disallow: /dede/Images/products_nav_*.png
Disallow: /frfr/Images/products_nav_*.png
Disallow: /huhu/Images/products_nav_*.png
Disallow: /nlnl/Images/products_nav_*.png
Disallow: /plpl/Images/products_nav_*.png

Sitemap: https://www.keyence.com/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived