NeuralCrawl

Heineken / robots.txt snapshot

← back to theheinekencompany.com · fetched 2026-06-20T01:10:30Z (15h ago) · HTTP 200 · 1099 bytes · sha256 12e23870b224153a · raw

final URL: https://www.theheinekencompany.com/robots.txt

1User-agent: *
2# CSS, JS, Images
3Allow: /core/*.css$
4Allow: /core/*.css?
5Allow: /core/*.js$
6Allow: /core/*.js?
7Allow: /core/*.gif
8Allow: /core/*.jpg
9Allow: /core/*.jpeg
10Allow: /core/*.png
11Allow: /core/*.svg
12Allow: /profiles/*.css$
13Allow: /profiles/*.css?
14Allow: /profiles/*.js$
15Allow: /profiles/*.js?
16Allow: /profiles/*.gif
17Allow: /profiles/*.jpg
18Allow: /profiles/*.jpeg
19Allow: /profiles/*.png
20Allow: /profiles/*.svg
21# Directories
22Disallow: /core/
23Disallow: /profiles/
24# Files
25Disallow: /README.txt
26Disallow: /web.config
27# Paths (clean URLs)
28Disallow: /admin/
29Disallow: /comment/reply/
30Disallow: /filter/tips
31Disallow: /node/add/
32Disallow: /search/
33Disallow: /user/*
34# Paths (no clean URLs)
35Disallow: /index.php/admin/
36Disallow: /index.php/comment/reply/
37Disallow: /index.php/filter/tips
38Disallow: /index.php/node/add/
39Disallow: /index.php/search/
40Disallow: /index.php/user/*
41
42#Sitemap sitemap.xml will be auto added at the end of this document.
43Disallow: /node/*
44Disallow: /taxonomy/*
45# XML sitemap. This string generated by id_custom module.
46Sitemap: https://www.theheinekencompany.com/sitemap.xml