NeuralCrawl

Novartis / robots.txt snapshot

← back to novartis.com · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 1687 bytes · sha256 1f032fd298a968cb · raw

final URL: https://www.novartis.com/robots.txt

1#
2# robots.txt
3#
4# This file is to prevent the crawling and indexing of certain parts
5# of your site by web crawlers and spiders run by sites like Yahoo!
6# and Google. By telling these "robots" where not to go on your site,
7# you save bandwidth and server resources.
8#
9# This file will be ignored unless it is at the root of your host:
10# Used: http://example.com/robots.txt
11# Ignored: http://example.com/site/robots.txt
12#
13# For more information about the robots.txt standard, see:
14# http://www.robotstxt.org/robotstxt.html
15
16User-agent: *
17# CSS, JS, Images
18Allow: /core/*.css$
19Allow: /core/*.css?
20Allow: /core/*.js$
21Allow: /core/*.js?
22Allow: /core/*.gif
23Allow: /core/*.jpg
24Allow: /core/*.jpeg
25Allow: /core/*.png
26Allow: /core/*.svg
27Allow: /profiles/*.css$
28Allow: /profiles/*.css?
29Allow: /profiles/*.js$
30Allow: /profiles/*.js?
31Allow: /profiles/*.gif
32Allow: /profiles/*.jpg
33Allow: /profiles/*.jpeg
34Allow: /profiles/*.png
35Allow: /profiles/*.svg
36# Directories
37Disallow: /core/
38Disallow: /profiles/
39# Files
40
41Disallow: /web.config
42# Paths (clean URLs)
43
44Disallow: /comment/reply/
45Disallow: /filter/tips
46
47Disallow: /search/
48
49
50
51
52# Paths (no clean URLs)
53
54Disallow: /index.php/comment/reply/
55Disallow: /index.php/filter/tips
56
57Disallow: /index.php/search/
58
59
60
61
62Disallow: /*printable
63Disallow: /de-de/sites/novartis_de/files/*FAQ*
64Disallow: /de-de/sites/novartis_de/files/*RMP*
65Disallow: /de-de/sites/novartis_de/files/*FAQ*
66Disallow: /de-de/sites/novartis_de/files/*RMP*
67Disallow: /*/spc/*FAQ*
68Disallow: /*/spc/*RMP*
69Disallow: /*FAQ*
70Disallow: /*RMP*
71
72User-agent: GPTBot
73Crawl-delay: 2
74
75User-agent: bingbot
76Crawl-delay: 2
77
78User-agent: GoogleOther
79Crawl-delay: 2
80Disallow: /export/site-selector.json