NeuralCrawl

πŸ‡ΏπŸ‡¦ Anglo American

angloamerican.com · National indices · rank #89 · Materials · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1481 bytes · sha256 bf072f48545f · raw

User-agent: *
Allow: / 
Disallow: /*ics?
Disallow: /WebResource
Disallow: /*?page
Disallow: /*?product=
Disallow: /*?async=
Disallow: /*?sc_lang=en
Disallow: /sitecore/ 
Disallow: /error-page.aspx 
Disallow: /site-services/where-we-operate-map-new
Disallow: /site-services/where-we-operate-map-new.aspx
Disallow: /our-stories/leadership/a-look-back-on-mark-cutifanis-time-at-anglo-american
Disallow: /our-stories/leadership/a-look-back-on-mark-cutifanis-time-at-anglo-american.aspx
Disallow: /our-stories/leadership/welcoming-our-new-chief-executive-mark-cutifani
Disallow: /our-stories/leadership/welcoming-our-new-chief-executive-mark-cutifani.aspx
Disallow: /~/media/Files/A/Anglo-American-PLC-V2/*
Disallow: /~/media/Files/A/Anglo-American-Group-v9/PLC/documents/chs-inclusion-and-accessibility.pdf
Disallow: /~/media/Images/A/Anglo-American-Group-v9/PLC/media/image-gallery/images/mogalakwena-pit-view-image.jpg
Disallow: /~/media/Images/A/Anglo-American-Group-v9/PLC/media/image-gallery/images/ebp-2804-image.jpg
Disallow: /~/media/Files/A/Anglo-American-Group-v5/PLC/sustainability/anglo-american-responsible-commodity-sourcing-policy-for-marketing.pdf
Disallow: /chs-inclusion-and-accessibility
Disallow: /site-services/country-map
Disallow: /site-services/materiality-matrix-new

Sitemap: https://www.angloamerican.com/sitemap.xml
Sitemap: https://www.angloamerican.com/site-services/search-and-apply-data-fetch.xml?aadata=get-job-sitemap-xml&content-type=application/xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived