SoundCloud / robots.txt snapshot
← back to soundcloud.com · fetched 2026-06-25T22:38:12Z (7h ago) · HTTP 200 · 1441 bytes · sha256 7bbee7524de61fe4 · raw
final URL: https://soundcloud.com/robots.txt
| 1 | # ============================================================================= |
| 2 | # robots.txt for soundcloud.com |
| 3 | # Updated: 2026-05-05 |
| 4 | # ============================================================================= |
| 5 | |
| 6 | # AI Crawlers: editorial only, no UGC training |
| 7 | User-Agent: anthropic-ai |
| 8 | User-Agent: ClaudeBot |
| 9 | User-Agent: Claude-Web |
| 10 | User-Agent: GPTBot |
| 11 | User-Agent: ChatGPT-User |
| 12 | User-Agent: OAI-SearchBot |
| 13 | User-Agent: CCBot |
| 14 | User-Agent: PerplexityBot |
| 15 | User-Agent: Google-Extended |
| 16 | User-Agent: Applebot-Extended |
| 17 | User-Agent: Bytespider |
| 18 | User-Agent: Amazonbot |
| 19 | User-Agent: Meta-ExternalAgent |
| 20 | User-Agent: cohere-ai |
| 21 | |
| 22 | # Homepage |
| 23 | Allow: /$ |
| 24 | |
| 25 | # Platform: Legal & Policy |
| 26 | Allow: /terms-of-use |
| 27 | Allow: /community-guidelines |
| 28 | Allow: /transparency-reports |
| 29 | Allow: /accessibility-statement |
| 30 | Allow: /imprint |
| 31 | |
| 32 | # Platform: Discovery & Editorial |
| 33 | Allow: /discover |
| 34 | Allow: /stories |
| 35 | Allow: /topic |
| 36 | |
| 37 | # Platform: Product & Marketing |
| 38 | Allow: /pro |
| 39 | Allow: /download |
| 40 | Allow: /jobs |
| 41 | Allow: /go |
| 42 | Allow: /getstarted |
| 43 | |
| 44 | # Platform: Corporate |
| 45 | Allow: /company |
| 46 | |
| 47 | # Platform: Technical |
| 48 | Allow: /sitemap |
| 49 | Allow: /sitemapIndex |
| 50 | |
| 51 | # Block everything else (catches all UGC at root paths) |
| 52 | Disallow: / |
| 53 | |
| 54 | # Search engines and all other crawlers: index UGC, block low-value paths |
| 55 | User-Agent: * |
| 56 | Disallow: /search |
| 57 | Disallow: /you/ |
| 58 | Disallow: /stream |
| 59 | Disallow: /upload |
| 60 | Disallow: /settings |
| 61 | Disallow: /messages |
| 62 | Disallow: /*? |
| 63 | |
| 64 | Sitemap: https://soundcloud.com/sitemap.xml |
| 65 | Sitemap: https://soundcloud.com/sitemapIndex.xml |