NeuralCrawl

The Independent / robots.txt snapshot

← back to independent.co.uk · fetched 2026-06-20T01:10:30Z (15h ago) · HTTP 200 · 1304 bytes · sha256 730ccddf0996e451 · raw

final URL: https://www.independent.co.uk/robots.txt

1
2User-agent: *
3
4# Files
5Disallow: /CHANGELOG.txt
6Disallow: /cron.php
7Disallow: /_fs-ch-
8Disallow: /INSTALL.mysql.txt
9Disallow: /INSTALL.pgsql.txt
10Disallow: /INSTALL.sqlite.txt
11Disallow: /install.php
12Disallow: /INSTALL.txt
13Disallow: /LICENSE.txt
14Disallow: /MAINTAINERS.txt
15Disallow: /update.php
16Disallow: /UPGRADE.txt
17Disallow: /xmlrpc.php
18Disallow: /img/placeholder.gif
19# Paths (clean URLs)
20Disallow: /admin/
21Disallow: /comment/reply/
22Disallow: /filter/tips/
23Disallow: /node/add/
24Disallow: /search/
25Disallow: /user/register/
26Disallow: /user/password/
27Disallow: /user/login/
28Disallow: /user/logout/
29Disallow: /internal-api/
30# Paths (no clean URLs)
31Disallow: /?q=admin/
32Disallow: /?q=comment/reply/
33Disallow: /?q=filter/tips/
34Disallow: /?q=node/add/
35Disallow: /?q=search/
36Disallow: /?q=user/password/
37Disallow: /?q=user/register/
38Disallow: /?q=user/login/
39Disallow: /?q=user/logout/
40Disallow: /pugpig/
41Disallow: /videorequest
42Disallow: /api/
43Disallow: /api/html/
44Disallow: /api/user/
45Disallow: /71347885/
46Disallow: /cb
47
48# Ignore liveblog pagination and swipe tracking
49Disallow: *itm_channel=native
50Disallow: *?page=
51
52# Ignore refresh URLs
53Disallow: /*ILC-refresh
54
55User-agent: Nutch
56Disallow: /
57
58Sitemap: https://www.independent.co.uk/sitemaps/googlenews
59Sitemap: https://www.independent.co.uk/sitemap.xml
60