NeuralCrawl

πŸ‡ΊπŸ‡Έ Twitch

twitch.tv · Social Networks · rank #13 · Video · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1096 bytes · sha256 e17c3908cc6c · raw

User-agent: Amazonbot
Disallow:

User-agent: Amzn-SearchBot
Disallow:

User-agent: Amzn-User
Disallow:

User-agent: *
Disallow: /login
Disallow: /signup
Disallow: /auth/
Disallow: /user/account-recovery
Disallow: /user/password-reset
Disallow: /user/not-me
Disallow: /activate
Disallow: /admin/*
Disallow: /wv/*
Disallow: /user/*
Disallow: /message/*
Disallow: /settings/
Disallow: /subscriptions
Disallow: /inventory
Disallow: /wallet
Disallow: /drops/
Disallow: /payments/
Disallow: /email-unsubscribe/
Disallow: /search
Disallow: /bits-checkout
Disallow: /redeem
Disallow: /claim
Disallow: /download-keys
Disallow: /popout/
Disallow: /embed/
Disallow: /chat/
Disallow: /moderator/
Disallow: /stream
Disallow: /collab
Disallow: /guest-star
Disallow: /sq
Disallow: /gl
Disallow: /sportradar-widgets
Disallow: /beta/
Disallow: /extensions/oauth-redirect
Disallow: /copyright-claims
Disallow: /report-appeal
Disallow: /oss-attribution
Disallow: /dj-signup
Disallow: /r/
Disallow: /stale/
Allow: /directory
Allow: /.well-known/assetlinks.json

Sitemap: https://www.twitch.tv/sitemapv2_index.xml.gz

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived