NeuralCrawl

๐Ÿ‡ฌ๐Ÿ‡ง Rio Tinto

riotinto.com · Top 1000 websites · rank #835 · Materials · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 463 bytes · sha256 a236268e27d3 · raw

User-agent: *
Disallow: /404
Disallow: /jp/
Disallow: /kr/
Disallow: /CMS/
Disallow: /search/
Disallow: /Home/404
Disallow: /sitecore/
Disallow: /-/media/Base-Themes

Sitemap: https://www.riotinto.com/sitemap.xml
Sitemap: https://www.riotinto.com/mn/sitemap.xml
Sitemap: https://www.riotinto.com/jp/sitemap.xml
Sitemap: https://www.riotinto.com/kr/sitemap.xml
Sitemap: https://www.riotinto.com/can/sitemap.xml
Sitemap: https://www.riotinto.com/master/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived