BBC / robots.txt snapshot
← back to bbc.com · fetched 2026-06-20T01:10:30Z (17h ago) · HTTP 200 · 5656 bytes · sha256 7a21cb855b2e58ae · raw
final URL: https://www.bbc.com/robots.txt
| 1 | |
| 2 | # version: 36756f545af9144e59780f65727af4ba98093b3b |
| 3 | # The BBC's Terms of Use: https://www.bbc.co.uk/terms |
| 4 | # - Explain the rules for using our services |
| 5 | # - Tell you what you can do with our content |
| 6 | # |
| 7 | # In short: Please use our site like a human, not a robot. |
| 8 | # That means: |
| 9 | # - No scraping, crawling, or systematic extraction of content |
| 10 | # - No use of BBC content for training or fine-tuning AI models, including large language models (LLMs) |
| 11 | # - No retrieval-augmented generation (RAG), AI-powered search, agentic AI or grounding using BBC content |
| 12 | # - No creating datasets from BBC content |
| 13 | # - No text and data mining (TDM) under Article 4 of the EU Directive on Copyright in the Digital Single Market |
| 14 | # - No using BBC content to create summaries for your own use |
| 15 | # - No business use without permission (details: https://www.bbc.co.uk/usingthebbc/terms/can-i-use-bbc-content-for-my-business/) |
| 16 | # - The BBC reserves all rights in its content and expressly opts out of any statutory exceptions in any jurisdiction for text and data mining, as permitted by law |
| 17 | |
| 18 | # TL;DR: Browse, read, watch, enjoy - like a human. |
| 19 | # |
| 20 | |
| 21 | # HTTPS www.bbc.com |
| 22 | |
| 23 | User-agent: * |
| 24 | Sitemap: https://www.bbc.com/sitemaps/https-index-com-archive.xml |
| 25 | Sitemap: https://www.bbc.com/sitemaps/https-index-com-news.xml |
| 26 | Sitemap: https://www.bbc.com/sitemaps/https-index-com-archive_video.xml |
| 27 | Sitemap: https://www.bbc.com/sitemaps/https-index-com-video.xml |
| 28 | Sitemap: https://www.bbc.com/sitemaps/sitemap-com-ws-topics.xml |
| 29 | Sitemap: https://www.bbc.com/sport/sitemap.xml |
| 30 | Sitemap: https://www.bbc.com/sitemaps/sitemap-com-ws-topics.xml |
| 31 | Sitemap: https://www.bbc.com/afrique/sitemap.xml |
| 32 | Sitemap: https://www.bbc.com/arabic/sitemap.xml |
| 33 | Sitemap: https://www.bbc.com/bengali/sitemap.xml |
| 34 | Sitemap: https://www.bbc.com/burmese/sitemap.xml |
| 35 | Sitemap: https://www.bbc.com/gahuza/sitemap.xml |
| 36 | Sitemap: https://www.bbc.com/hausa/sitemap.xml |
| 37 | Sitemap: https://www.bbc.com/hindi/sitemap.xml |
| 38 | Sitemap: https://www.bbc.com/indonesia/sitemap.xml |
| 39 | Sitemap: https://www.bbc.com/mundo/sitemap.xml |
| 40 | Sitemap: https://www.bbc.com/pashto/sitemap.xml |
| 41 | Sitemap: https://www.bbc.com/persian/sitemap.xml |
| 42 | Sitemap: https://www.bbc.com/portuguese/sitemap.xml |
| 43 | Sitemap: https://www.bbc.com/russian/sitemap.xml |
| 44 | Sitemap: https://www.bbc.com/swahili/sitemap.xml |
| 45 | Sitemap: https://www.bbc.com/tajik/sitemap.xml |
| 46 | Sitemap: https://www.bbc.com/turkce/sitemap.xml |
| 47 | Sitemap: https://www.bbc.com/ukchina/simp/sitemap.xml |
| 48 | Sitemap: https://www.bbc.com/ukrainian/sitemap.xml |
| 49 | Sitemap: https://www.bbc.com/urdu/sitemap.xml |
| 50 | Sitemap: https://www.bbc.com/uzbek/sitemap.xml |
| 51 | Sitemap: https://www.bbc.com/vietnamese/sitemap.xml |
| 52 | Sitemap: https://www.bbc.com/zhongwen/simp/sitemap.xml |
| 53 | Sitemap: https://www.bbc.com/zhongwen/trad/sitemap.xml |
| 54 | Sitemap: https://www.bbc.com/bbcx/index_sitemap.xml |
| 55 | Sitemap: https://www.bbc.com/bbcx/audio_archive_sitemap.xml |
| 56 | Sitemap: https://www.bbc.com/bbcx/video_documentaries_sitemap.xml |
| 57 | Sitemap: https://www.bbc.com/bbcx/content_index_sitemap.xml |
| 58 | |
| 59 | Disallow: /asset/ |
| 60 | Disallow: /backstage/bbc-login-help/ |
| 61 | Disallow: /backstage/bbc-login-help$ |
| 62 | Disallow: /bitesize/search$ |
| 63 | Disallow: /bitesize/search/ |
| 64 | Disallow: /bitesize/search? |
| 65 | Disallow: /cbbc/search/ |
| 66 | Disallow: /cbbc/search$ |
| 67 | Disallow: /cbbc/search? |
| 68 | Disallow: /cbeebies/search/ |
| 69 | Disallow: /cbeebies/search$ |
| 70 | Disallow: /cbeebies/search? |
| 71 | Disallow: /chwilio/ |
| 72 | Disallow: /chwilio$ |
| 73 | Disallow: /chwilio? |
| 74 | Disallow: /education/blocks$ |
| 75 | Disallow: /education/blocks/ |
| 76 | Disallow: /newsround |
| 77 | Disallow: /search/ |
| 78 | Disallow: /search$ |
| 79 | Disallow: /search? |
| 80 | Disallow: /food/favourites |
| 81 | Disallow: /food/search*?* |
| 82 | Disallow: /food/recipes/search*?* |
| 83 | Disallow: /education/my$ |
| 84 | Disallow: /education/my/ |
| 85 | Disallow: /bitesize/my$ |
| 86 | Disallow: /bitesize/my/ |
| 87 | Disallow: /food/recipes/*/shopping-list |
| 88 | Disallow: /food/menus/*/shopping-list |
| 89 | Disallow: /news/0 |
| 90 | Disallow: /sport/alpha/ |
| 91 | Disallow: /ugc$ |
| 92 | Disallow: /ugc/ |
| 93 | Disallow: /ugcsupport$ |
| 94 | Disallow: /ugcsupport/ |
| 95 | Disallow: /userinfo/ |
| 96 | Disallow: /userinfo |
| 97 | Disallow: /u5llnop$ |
| 98 | Disallow: /u5llnop/ |
| 99 | Disallow: /sounds/search$ |
| 100 | Disallow: /sounds/search/ |
| 101 | Disallow: /sounds/search? |
| 102 | Disallow: /ws/includes |
| 103 | Disallow: /radio/imda |
| 104 | Disallow: /storyworks/preview/* |
| 105 | Disallow: /rd/search$ |
| 106 | Disallow: /rd/search/ |
| 107 | Disallow: /rd/search? |
| 108 | |
| 109 | User-agent: Amazonbot |
| 110 | Disallow: / |
| 111 | |
| 112 | User-agent: magpie-crawler |
| 113 | Disallow: / |
| 114 | |
| 115 | User-agent: CCBot |
| 116 | Disallow: / |
| 117 | |
| 118 | User-Agent: omgili |
| 119 | Disallow: / |
| 120 | |
| 121 | User-Agent: omgilibot |
| 122 | Disallow: / |
| 123 | |
| 124 | User-agent: Claude-Web |
| 125 | Disallow: / |
| 126 | |
| 127 | User-agent: ClaudeBot |
| 128 | Disallow: / |
| 129 | |
| 130 | User-agent: anthropic-ai |
| 131 | Disallow: / |
| 132 | |
| 133 | User-agent: cohere-ai |
| 134 | Disallow: / |
| 135 | |
| 136 | User-agent: Bytespider |
| 137 | Disallow: / |
| 138 | |
| 139 | User-agent: PetalBot |
| 140 | Disallow: / |
| 141 | |
| 142 | User-agent: Scrapy |
| 143 | Disallow: / |
| 144 | |
| 145 | User-agent: Applebot-Extended |
| 146 | Disallow: / |
| 147 | |
| 148 | User-agent: GPTBot |
| 149 | Disallow: / |
| 150 | |
| 151 | User-agent: ChatGPT-User |
| 152 | Disallow: / |
| 153 | |
| 154 | User-agent: Google-Extended |
| 155 | Disallow: / |
| 156 | |
| 157 | User-Agent: PerplexityBot |
| 158 | Disallow: / |
| 159 | |
| 160 | User-agent: Perplexity-User |
| 161 | Disallow: / |
| 162 | |
| 163 | User-agent: Google-CloudVertexBot |
| 164 | Disallow: / |
| 165 | |
| 166 | User-agent: meta-externalagent |
| 167 | Disallow: / |
| 168 | |
| 169 | User-agent: OAI-SearchBot |
| 170 | Disallow: / |
| 171 | |
| 172 | User-agent: YandexAdditional |
| 173 | Disallow: / |
| 174 | |
| 175 | User-agent: YandexAdditionalBot |
| 176 | Disallow: / |
| 177 | |
| 178 | User-agent: TurnitinBot |
| 179 | Disallow: / |
| 180 | |
| 181 | User-agent: Brightbot |
| 182 | Disallow: / |
| 183 | |
| 184 | User-agent: ApifyBot |
| 185 | Disallow: / |
| 186 | |
| 187 | User-agent: ApifyWebsiteContentCrawler |
| 188 | Disallow: / |
| 189 | |
| 190 | User-agent: Diffbot |
| 191 | Disallow: / |
| 192 | |
| 193 | User-agent: Diffbot-User |
| 194 | Disallow: / |
| 195 | |
| 196 | User-agent: ExaBot |
| 197 | Disallow: / |
| 198 | |
| 199 | User-agent: TavilyBot |
| 200 | Disallow: / |
| 201 | |
| 202 | User-agent: ShapBot |
| 203 | Disallow: / |
| 204 | |
| 205 | User-agent: YouBot |
| 206 | Disallow: / |
| 207 | |
| 208 | User-agent: FirecrawlAgent |
| 209 | Disallow: / |
| 210 | |
| 211 | User-agent: Amzn-SearchBot |
| 212 | Disallow: / |
| 213 | |
| 214 | User-agent: Amzn-User |
| 215 | Disallow: / |
| 216 | |
| 217 | User-agent: ProRataInc |
| 218 | Disallow: / |
| 219 | |
| 220 | User-agent: CloudflareBrowserRenderingCrawler |
| 221 | Disallow: / |
| 222 | |
| 223 | User-agent: AhrefsBot |
| 224 | Disallow: / |