NeuralCrawl

Archive of Our Own / robots.txt snapshot

← back to archiveofourown.org · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 872 bytes · sha256 1493972bb388474f · raw

final URL: https://archiveofourown.org/robots.txt

1# See https://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
2
3User-agent: *
4Disallow: /works? # cruel but efficient
5Disallow: /autocomplete/
6Disallow: /downloads/
7Disallow: /external_works/
8# disallow indexing of search results
9Disallow: /bookmarks/search?
10Disallow: /people/search?
11Disallow: /tags/search?
12Disallow: /works/search?
13
14User-agent: Googlebot
15Disallow: /autocomplete/
16Disallow: /downloads/
17Disallow: /external_works/
18# Googlebot is smart and knows pattern matching
19Disallow: /works/*?
20Disallow: /*search?
21Disallow: /*?*query=
22Disallow: /*?*sort_
23Disallow: /*?*selected_tags
24Disallow: /*?*view_adult
25Disallow: /*?*tag_id
26Disallow: /*?*pseud_id
27Disallow: /*?*user_id
28Disallow: /*?*pseud=
29
30User-agent: CCBot
31Disallow: /
32
33User-agent: GPTBot
34Disallow: /
35
36User-agent: ChatGPT-User
37Disallow: /
38
39User-agent: Slurp
40Crawl-delay: 30