TotalEnergies / robots.txt snapshot
← back to totalenergies.com · fetched 2026-06-20T01:10:30Z (15h ago) · HTTP 200 · 4248 bytes · sha256 3e614c1c032f938d · raw
final URL: https://totalenergies.com/robots.txt
| 1 | # |
| 2 | # robots.txt |
| 3 | # |
| 4 | # This file is to prevent the crawling and indexing of certain parts |
| 5 | # of your site by web crawlers and spiders run by sites like Yahoo! |
| 6 | # and Google. By telling these "robots" where not to go on your site, |
| 7 | # you save bandwidth and server resources. |
| 8 | # |
| 9 | # This file will be ignored unless it is at the root of your host: |
| 10 | # Used: http://example.com/robots.txt |
| 11 | # Ignored: http://example.com/site/robots.txt |
| 12 | # |
| 13 | # For more information about the robots.txt standard, see: |
| 14 | # http://www.robotstxt.org/robotstxt.html |
| 15 | |
| 16 | User-agent: * |
| 17 | # CSS, JS, Images |
| 18 | Allow: /core/*.css$ |
| 19 | Allow: /core/*.css? |
| 20 | Allow: /core/*.js$ |
| 21 | Allow: /core/*.js? |
| 22 | Allow: /core/*.gif |
| 23 | Allow: /core/*.jpg |
| 24 | Allow: /core/*.jpeg |
| 25 | Allow: /core/*.png |
| 26 | Allow: /core/*.svg |
| 27 | Allow: /profiles/*.css$ |
| 28 | Allow: /profiles/*.css? |
| 29 | Allow: /profiles/*.js$ |
| 30 | Allow: /profiles/*.js? |
| 31 | Allow: /profiles/*.gif |
| 32 | Allow: /profiles/*.jpg |
| 33 | Allow: /profiles/*.jpeg |
| 34 | Allow: /profiles/*.png |
| 35 | Allow: /profiles/*.svg |
| 36 | # Directories |
| 37 | Disallow: /core/ |
| 38 | Disallow: /profiles/ |
| 39 | # Files |
| 40 | Disallow: /README.txt |
| 41 | Disallow: /web.config |
| 42 | # Paths (clean URLs) |
| 43 | Disallow: /comment/reply/ |
| 44 | Disallow: /filter/tips |
| 45 | Disallow: /node/add/ |
| 46 | Allow: /search/ |
| 47 | Allow: /recherche/ |
| 48 | # Paths (no clean URLs) |
| 49 | Disallow: /index.php/comment/reply/ |
| 50 | Disallow: /index.php/filter/tips |
| 51 | Disallow: /index.php/node/add/ |
| 52 | Disallow: /index.php/search/ |
| 53 | Disallow: /index.php/user/logout/ |
| 54 | # Search (with parameters) |
| 55 | Allow: /search/content?* |
| 56 | Allow: /recherche/contenu?* |
| 57 | Allow: */medias/actualites*page=* |
| 58 | Allow: */media/news*page=* |
| 59 | Allow: */medias/medias*page=* |
| 60 | Allow: */media/media*page=* |
| 61 | Disallow: */formulaire-de-contact/* |
| 62 | Disallow: */contact-form/* |
| 63 | Disallow: /search-content* |
| 64 | Disallow: */recherche-contenu* |
| 65 | # Integration (static files) |
| 66 | Disallow: /themes/custom/*/integration/ |
| 67 | Disallow: */contenu/publications/* |
| 68 | |
| 69 | # ------------------------- |
| 70 | # Newsroom |
| 71 | # ------------------------- |
| 72 | # 1) Newsroom home (FR/EN) – allow only lang=fra or lang=eng |
| 73 | Allow: */newsroom/fr/?lang=fra |
| 74 | Allow: */newsroom/en/?lang=eng |
| 75 | Disallow: */newsroom/fr/? |
| 76 | Disallow: */newsroom/en/? |
| 77 | |
| 78 | # 2) Communiqués de presse – allow only all-themes (+ pagination) |
| 79 | Allow: */newsroom/section/communiques-de-presse/?lang=fra&topic=all-themes$ |
| 80 | Allow: */newsroom/section/communiques-de-presse/?lang=eng&topic=all-themes$ |
| 81 | Allow: */newsroom/section/communiques-de-presse/?lang=fra&topic=all-themes&pages= |
| 82 | Allow: */newsroom/section/communiques-de-presse/?lang=eng&topic=all-themes&pages= |
| 83 | Disallow: */newsroom/section/communiques-de-presse/? |
| 84 | |
| 85 | # 3) Dossiers de presse – allow only lang param (FR/EN) |
| 86 | Allow: */newsroom/section/dossiers-de-presse/?lang=fra |
| 87 | Allow: */newsroom/section/dossiers-de-presse/?lang=eng |
| 88 | Disallow: */newsroom/section/dossiers-de-presse/? |
| 89 | |
| 90 | # 4) Revues de presse – allow only vu-dans-la-presse (+ pagination) FR/EN |
| 91 | Allow: */newsroom/section/revues-de-presse/?lang=fra&cat=vu-dans-la-presse$ |
| 92 | Allow: */newsroom/section/revues-de-presse/?lang=eng&cat=vu-dans-la-presse$ |
| 93 | Allow: */newsroom/section/revues-de-presse/?lang=fra&cat=vu-dans-la-presse&pages= |
| 94 | Allow: */newsroom/section/revues-de-presse/?lang=eng&cat=vu-dans-la-presse&pages= |
| 95 | Disallow: */newsroom/section/revues-de-presse/? |
| 96 | |
| 97 | # 5) Contacts – allow only lang param (FR/EN) |
| 98 | Allow: */newsroom/section/contacts/?lang=fra |
| 99 | Allow: */newsroom/section/contacts/?lang=eng |
| 100 | Disallow: */newsroom/section/contacts/? |
| 101 | |
| 102 | # 6) Search – disallow all |
| 103 | Disallow: */newsroom/?s |
| 104 | Disallow: */newsroom/?lang=eng&s |
| 105 | Disallow: */newsroom/?lang=fra&s |
| 106 | |
| 107 | # 7) Statements – allow lang only & all-themes (+ pagination) |
| 108 | Allow: */newsroom/section/statements/?lang=fra$ |
| 109 | Allow: */newsroom/section/statements/?lang=eng$ |
| 110 | Allow: */newsroom/section/statements/?lang=fra&topic=all-themes$ |
| 111 | Allow: */newsroom/section/statements/?lang=eng&topic=all-themes$ |
| 112 | Allow: */newsroom/section/statements/?lang=fra&pages= |
| 113 | Allow: */newsroom/section/statements/?lang=eng&pages= |
| 114 | Allow: */newsroom/section/statements/?lang=fra&topic=all-themes&pages= |
| 115 | Allow: */newsroom/section/statements/?lang=eng&topic=all-themes&pages= |
| 116 | Disallow: */newsroom/section/statements/? |
| 117 | |
| 118 | # 8) Selection – disallow all |
| 119 | Disallow: */newsroom/fr/selection/ |
| 120 | Disallow: */newsroom/en/selection/ |
| 121 | Sitemap: https://totalenergies.com/sitemap.xml |