NeuralCrawl

πŸ‡²πŸ‡½ Cemex

cemex.com · National indices · rank #87 · Materials · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 537 bytes · sha256 54ff0878e824 · raw

# this is the robots.txt file for cemex.com
# este es el archivo robots.txt de cemex.com

User-Agent: *

Disallow: /web/
Disallow: /asset-publisher/
Disallow: /-/asset_publisher/
Disallow: /view/-/
Disallow: /zh/
Disallow: /cz/
Disallow: /ja/
Disallow: /nl/
Disallow: /fr/
Disallow: /pt/
Disallow: /de/
Disallow: /c/
Disallow: /es-MX/
Disallow: /en-US/

Sitemap: https://www.cemex.com/sitemap.xml

# see our latest career opportunities at https://jobs.cemex.com/
# vea nuestras ΓΊltimas oportunidades laborales en https://jobs.cemex.com/

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived