NeuralCrawl

Societe Generale / robots.txt snapshot

← back to societegenerale.com · fetched 2026-06-20T01:10:30Z (15h ago) · HTTP 200 · 1654 bytes · sha256 399e4ae3a8fc957c · raw

final URL: https://www.societegenerale.com/robots.txt

1#
2# robots.txt
3#
4# This file is to prevent the crawling and indexing of certain parts
5# of your site by web crawlers and spiders run by sites like Yahoo!
6# and Google. By telling these "robots" where not to go on your site,
7# you save bandwidth and server resources.
8#
9# This file will be ignored unless it is at the root of your host:
10# Used: http://example.com/robots.txt
11# Ignored: http://example.com/site/robots.txt
12#
13# For more information about the robots.txt standard, see:
14# http://www.robotstxt.org/robotstxt.html
15
16Sitemap: https://www.societegenerale.com/sg_search/sitemap
17
18User-agent: *
19# CSS, JS, Images
20Allow: /core/*.css$
21Allow: /core/*.css?
22Allow: /core/*.js$
23Allow: /core/*.js?
24Allow: /core/*.gif
25Allow: /core/*.jpg
26Allow: /core/*.jpeg
27Allow: /core/*.png
28Allow: /core/*.svg
29Allow: /profiles/*.css$
30Allow: /profiles/*.css?
31Allow: /profiles/*.js$
32Allow: /profiles/*.js?
33Allow: /profiles/*.gif
34Allow: /profiles/*.jpg
35Allow: /profiles/*.jpeg
36Allow: /profiles/*.png
37Allow: /profiles/*.svg
38# Directories
39Disallow: /core/
40Disallow: /profiles/
41# Files
42Disallow: /README.txt
43Disallow: /web.config
44# Paths (clean URLs)
45Disallow: /admin/
46Disallow: /comment/reply/
47Disallow: /filter/tips
48Disallow: /node/add/
49Disallow: /search/
50Disallow: /user/register/
51Disallow: /user/password/
52Disallow: /user/login/
53Disallow: /user/logout/
54# Paths (no clean URLs)
55Disallow: /index.php/admin/
56Disallow: /index.php/comment/reply/
57Disallow: /index.php/filter/tips
58Disallow: /index.php/node/add/
59Disallow: /index.php/search/
60Disallow: /index.php/user/password/
61Disallow: /index.php/user/register/
62Disallow: /index.php/user/login/
63Disallow: /index.php/user/logout/