NeuralCrawl

πŸ‡ΊπŸ‡Έ Wish

wish.com · E-commerce · rank #20 · E-commerce · live robots.txt ↗

AI crawler access (latest snapshot, 2h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 566 bytes · sha256 4c25050a8bc4 · raw

Sitemap: https://www.wish.com/sitemap.xml

User-agent: Googlebot
Disallow:

User-agent: Googlebot-image
Disallow:

User-agent: *
Disallow: /unsubscribe
Disallow: /gift-cards
Disallow: /transaction
Disallow: /logout
Disallow: /settings
Disallow: /opt-in-mobile
Disallow: /*?next
Disallow: /*&next
Disallow: /cart
Disallow: /shipping
Disallow: /payment
Disallow: /order
Disallow: /notifications
Disallow: /rewards
Disallow: /cash
Disallow: /wishlist
Disallow: /profile
Disallow: /daily-login-bonus
Disallow: /settings
Disallow: /ugc_share/
Disallow: /product-ratings/

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived