NeuralCrawl

πŸ‡ΊπŸ‡Έ ArtStation

artstation.com · Top 1000 websites · rank #609 · Art · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 927 bytes · sha256 24429f0a6f26 · raw

User-agent: *
Disallow: /*/likes
Disallow: /*/following
Disallow: /*/followers
Disallow: /*/collections
Disallow: /*/collections/likes
Disallow: /*/collections/*
Disallow: /registration/*
Disallow: /studentpro
Disallow: /2fa
Disallow: /users/password/edit

Sitemap: https://www.artstation.com/sitemap.xml
Sitemap: https://www.artstation.com/api/v2/marketplace/product-sitemap-index.xml
Sitemap: https://www.artstation.com/api/v2/prints/printed-product-sitemap-index.xml
Sitemap: https://www.artstation.com/api/v2/prints/curated-pages-sitemap-index.xml
Sitemap: https://www.artstation.com/api/v2/blogging/blog-sitemap-index.xml
Sitemap: https://www.artstation.com/api/v2/jobs/jobs-sitemap-index.xml
Sitemap: https://www.artstation.com/api/v2/learning/sitemap-index.xml
Sitemap: https://www.artstation.com/api/v2/seo/sitemap-seo-pages-index.xml
Sitemap: https://www.artstation.com/api/v2/competition/challenges-sitemap-index.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived