NeuralCrawl

๐Ÿ‡ฌ๐Ÿ‡ง The Independent

independent.co.uk · Publishers · rank #41 · News · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1304 bytes · sha256 730ccddf0996 · raw

User-agent: *

# Files
Disallow: /CHANGELOG.txt
Disallow: /cron.php
Disallow: /_fs-ch-
Disallow: /INSTALL.mysql.txt
Disallow: /INSTALL.pgsql.txt
Disallow: /INSTALL.sqlite.txt
Disallow: /install.php
Disallow: /INSTALL.txt
Disallow: /LICENSE.txt
Disallow: /MAINTAINERS.txt
Disallow: /update.php
Disallow: /UPGRADE.txt
Disallow: /xmlrpc.php
Disallow: /img/placeholder.gif
# Paths (clean URLs)
Disallow: /admin/
Disallow: /comment/reply/
Disallow: /filter/tips/
Disallow: /node/add/
Disallow: /search/
Disallow: /user/register/
Disallow: /user/password/
Disallow: /user/login/
Disallow: /user/logout/
Disallow: /internal-api/
# Paths (no clean URLs)
Disallow: /?q=admin/
Disallow: /?q=comment/reply/
Disallow: /?q=filter/tips/
Disallow: /?q=node/add/
Disallow: /?q=search/
Disallow: /?q=user/password/
Disallow: /?q=user/register/
Disallow: /?q=user/login/
Disallow: /?q=user/logout/
Disallow: /pugpig/
Disallow: /videorequest
Disallow: /api/
Disallow: /api/html/
Disallow: /api/user/
Disallow: /71347885/
Disallow: /cb

# Ignore liveblog pagination and swipe tracking
Disallow: *itm_channel=native
Disallow: *?page=

# Ignore refresh URLs
Disallow: /*ILC-refresh

User-agent: Nutch
Disallow: /

Sitemap: https://www.independent.co.uk/sitemaps/googlenews
Sitemap: https://www.independent.co.uk/sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived

Editorial profile Content bias, reliability & geopolitical trust

Content biasLean Left CredibilityHigh reliability

Content bias (political lean) and credibility (factual track record) are third-party/aggregated assessments โ€” source: AllSides/MBFC โ€” and may be contested.