NeuralCrawl

Boehringer Ingelheim / robots.txt snapshot

← back to boehringer-ingelheim.com · fetched 2026-06-20T01:10:30Z (15h ago) · HTTP 200 · 2900 bytes · sha256 f0eb85747c7eea34 · raw

final URL: https://www.boehringer-ingelheim.com/robots.txt

1#
2# robots.txt
3#
4# This file is to prevent the crawling and indexing of certain parts
5# of your site by web crawlers and spiders run by sites like Yahoo!
6# and Google. By telling these "robots" where not to go on your site,
7# you save bandwidth and server resources.
8#
9# This file will be ignored unless it is at the root of your host:
10# Used: http://example.com/robots.txt
11# Ignored: http://example.com/site/robots.txt
12#
13# For more information about the robots.txt standard, see:
14# http://www.robotstxt.org/robotstxt.html
15
16User-agent: *
17# CSS, JS, Images
18Allow: /core/*.css$
19Allow: /core/*.css?
20Allow: /core/*.js$
21Allow: /core/*.js?
22Allow: /core/*.gif
23Allow: /core/*.jpg
24Allow: /core/*.jpeg
25Allow: /core/*.png
26Allow: /core/*.svg
27Allow: /profiles/*.css$
28Allow: /profiles/*.css?
29Allow: /profiles/*.js$
30Allow: /profiles/*.js?
31Allow: /profiles/*.gif
32Allow: /profiles/*.jpg
33Allow: /profiles/*.jpeg
34Allow: /profiles/*.png
35Allow: /profiles/*.svg
36
37# Directories
38Disallow: /core/
39Disallow: /profiles/
40
41# Files
42Disallow: /README.md
43
44# Paths (clean URLs)
45Disallow: /admin/
46Disallow: /comment/reply/
47Disallow: /filter/tips
48Disallow: /node/add/
49Disallow: /search/
50Disallow: /user/register
51Disallow: /user/password
52Disallow: /user/login
53Disallow: /user/logout
54Disallow: /media/oembed
55Disallow: /*/media/oembed
56
57# Paths (no clean URLs)
58Disallow: /index.php/admin/
59Disallow: /index.php/comment/reply/
60Disallow: /index.php/filter/tips
61Disallow: /index.php/node/add/
62Disallow: /index.php/search/
63Disallow: /index.php/user/password
64Disallow: /index.php/user/register
65Disallow: /index.php/user/login
66Disallow: /index.php/user/logout
67Disallow: /index.php/media/oembed
68Disallow: /index.php/*/media/oembed
69
70# Sitemaps not provided by this site but under the domain
71Sitemap: https://www.boehringer-ingelheim.com/ca/fr/impact/sitemap.xml
72Sitemap: https://www.boehringer-ingelheim.com/ca/impact/sitemap.xml
73Sitemap: https://www.boehringer-ingelheim.com/ch/fr/centres-pneumopathie-interstitielle/sitemap.xml
74Sitemap: https://www.boehringer-ingelheim.com/ch/it/centri-polmonite-interstiziale/sitemap.xml
75Sitemap: https://www.boehringer-ingelheim.com/ch/lungenzentren/sitemap.xml
76Sitemap: https://www.boehringer-ingelheim.com/es/comprometidos-por-el-futuro/sitemap.xml
77Sitemap: https://www.boehringer-ingelheim.com/es/premio-periodistico/sitemap.xml
78Sitemap: https://www.boehringer-ingelheim.com/hu/ild-centrumok/sitemap.xml
79Sitemap: https://www.boehringer-ingelheim.com/kz/илф-центры/sitemap.xml
80Sitemap: https://www.boehringer-ingelheim.com/pt/podcasts/value-insider/sitemap.xml
81Sitemap: https://www.boehringer-ingelheim.com/se/prio/sitemap.xml
82Sitemap: https://www.boehringer-ingelheim.com/us/science/boehringer-and-lilly-grants-alliance/sitemap.xml
83Disallow: /disclaimer/market
84Disallow: /disclaimer/bi
85Disallow: /disclaimer/external
86Disallow: /sites/default/files/*.pdf
87Disallow: /libraries/pdf.js/web/
88# Simple sitemap