NeuralCrawl

Ars Technica / robots.txt snapshot

← back to arstechnica.com · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 1912 bytes · sha256 0abd612f8a892958 · raw

final URL: https://arstechnica.com/robots.txt

1Sitemap: https://arstechnica.com/sitemap.xml
2
3# Google Image
4User-agent: Googlebot-Image
5Disallow:
6Allow: /*
7
8# Google AdSense
9User-agent: Mediapartners-Google*
10Disallow:
11
12User-agent: Google-Extended
13User-agent: Google-CloudVertexBot
14User-agent: GoogleOther
15User-agent: Applebot-Extended
16User-agent: meta-webindexer
17User-agent: meta-externalagent
18User-agent: meta-externalfetcher
19User-agent: ClaudeBot
20User-agent: Claude-SearchBot
21User-agent: Claude-User
22User-agent: PerplexityBot
23User-agent: Perplexity-User
24User-agent: cohere-training-data-crawler
25User-agent: cohere-ai
26User-agent: CCBot
27User-agent: PanguBot
28User-agent: PetalBot
29User-agent: Bytespider
30User-agent: Diffbot
31User-agent: DuckAssistBot
32User-agent: MistralAI-User
33User-agent: Timpibot
34User-agent: Webzio
35User-agent: Webzio-Extended
36User-agent: archive.org_bot
37User-agent: ia_archiver
38User-agent: ia_archiver-web.archive.org
39User-agent: heritrix
40User-agent: anthropic-ai
41User-agent: Claude-Web
42User-agent: FacebookBot
43User-agent: Omgilibot
44User-agent: YouBot
45Disallow: /
46
47User-agent: Amazonbot
48Disallow: /
49Allow: /feed/amazon-rss
50
51# Global
52User-agent: *
53Disallow: /cgi-bin/
54Disallow: /wp/wp-admin/
55Disallow: /wp/wp-includes/
56Disallow: /wp/wp-content/
57Disallow: /wp-content/plugins/
58Disallow: /wp-content/mu_plugins/
59Disallow: /wp-content/cache/
60Disallow: /wp-content/themes/
61Disallow: /trackback/
62Disallow: /comments/
63Disallow: /category/*/*
64Disallow: */trackback/
65Disallow: */comments/
66Disallow: /search
67Disallow: */*comments=
68Disallow: */*comments-page=
69Disallow: /services/*
70Disallow: /com.condenast/yv8*
71
72# Xenforo
73Disallow: /civis/account/
74Disallow: /civis/members/
75Disallow: /civis/attachments/
76Disallow: /civis/goto/
77Disallow: /civis/help/
78Disallow: /civis/posts/
79Disallow: /civis/login/
80Disallow: /civis/search/
81Disallow: /civis/admin.php
82Disallow: /civis/threads/*in_iframe=1*
83Disallow: /civis/threads/*wp_data=*
84Disallow: */toggle-*.json?