NeuralCrawl

๐Ÿ‡ฎ๐Ÿ‡ช Accenture

accenture.com · European companies · rank #119 · Technology · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 807 bytes · sha256 5678c8417105 · raw

User-agent: *
Disallow: */Careers/Registration
Disallow: */Careers/Form
Disallow: */Careers/Profiles
Disallow: */error/
Disallow: */loginpage
Disallow: */sitecore/
Disallow: */BucketContent
Disallow: */RedesignBucket
Disallow: */core/
Disallow: */?sc_lang
Disallow: */secure/
Disallow: */clients/
Disallow: */client/
Disallow: */a-com-no-follow-no-index/
Disallow: */_acnmedia/
Disallow: */_globalreferences/
Disallow: /us-en/careers/admin/careersxmlresult
Disallow: /*/header/global-header/
Disallow: /*.nocache.html
Disallow: */content/acom/
Disallow: /*tlaAppCB
Disallow: */careers/jobsearch?
Disallow: */search/ai-search
Disallow: */search/results
Disallow: /*WHB
Disallow: /*Channel
Disallow: /*channel 
Disallow: /*target_id
Disallow: /*utm_source
sitemap: https://www.accenture.com/sitemap-index.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived