NeuralCrawl

The New Yorker / robots.txt snapshot

← back to newyorker.com · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 1896 bytes · sha256 a0af8d88a8255293 · raw

final URL: https://www.newyorker.com/robots.txt

1Sitemap: https://www.newyorker.com/sitemap.xml
2Sitemap: https://www.newyorker.com/tagpages-sitemap.xml
3Sitemap: https://www.newyorker.com/categories-sitemap.xml
4Sitemap: https://www.newyorker.com/contributors-sitemap.xml
5Sitemap: https://www.newyorker.com/bundles-sitemap.xml
6Sitemap: https://www.newyorker.com/feed/google-news-sitemap-feed/sitemap-google-news
7Sitemap: https://www.newyorker.com/feed/rss
8
9User-agent: *
10Disallow: /*?
11Allow: /*?page
12Allow: /*.xml?
13Allow: /*.js
14Allow: /*.css
15Allow: /*.ico
16Allow: /verso/static
17Allow: /*.svg
18Allow: /*.png
19Allow: /*.jpg
20Allow: /*.jpeg
21Allow: /*rss?
22Allow: /*?id=
23Disallow: /auth/
24Disallow: /account/
25Disallow: /user/
26Disallow: /user-context
27Disallow: /preview/
28Disallow: /search
29Disallow: /product/
30Disallow: /cdn-cgi/
31Disallow: /services.min.js
32Disallow: /com.condenast/yv8
33
34###
35Allow: /v2/offers/tnya
36Disallow: /cartoon/
37Disallow: /newsletters/
38###
39
40User-agent: FacebookExternalHit
41User-agent: Twitterbot
42User-agent: Pinterestbot
43User-agent: LinkedInBot
44User-agent: TikTokSpider
45User-agent: Storebot-Google
46User-agent: Google-InspectionTool
47User-agent: AmazonAdBot
48Allow: /
49
50User-agent: Google-Extended
51User-agent: Google-CloudVertexBot
52User-agent: GoogleOther
53User-agent: Applebot-Extended
54User-agent: Amazonbot
55User-agent: meta-webindexer
56User-agent: meta-externalagent
57User-agent: meta-externalfetcher
58User-agent: ClaudeBot
59User-agent: Claude-SearchBot
60User-agent: Claude-User
61User-agent: PerplexityBot
62User-agent: Perplexity-User
63User-agent: cohere-training-data-crawler
64User-agent: cohere-ai
65User-agent: CCBot
66User-agent: PanguBot
67User-agent: PetalBot
68User-agent: Bytespider
69User-agent: Diffbot
70User-agent: DuckAssistBot
71User-agent: MistralAI-User
72User-agent: Timpibot
73User-agent: Webzio
74User-agent: Webzio-Extended
75User-agent: archive.org_bot
76User-agent: ia_archiver
77User-agent: ia_archiver-web.archive.org
78User-agent: heritrix
79Disallow: /