NeuralCrawl

Weaviate / robots.txt snapshot

← back to weaviate.io · fetched 2026-06-20T01:10:30Z (15h ago) · HTTP 200 · 556 bytes · sha256 f0d7abe6b9f7e78f · raw

final URL: https://weaviate.io/robots.txt

1Sitemap: https://weaviate.io/sitemap-index.xml
2LLMS: https://weaviate.io/llms.txt
3
4User-agent: *
5Allow: /
6Allow: /llms.txt
7Disallow: /*?*
8Disallow: /expert-sessions
9Disallow: /blog/rss.xml
10Disallow: /blog/atom.xml
11Disallow: /feed
12Disallow: /feed.xml
13Disallow: /rss
14Disallow: /rss.xml
15Disallow: /atom
16Disallow: /atom.xml
17
18# AI Search Engine Bots
19User-agent: GPTBot
20Allow: /
21
22
23User-agent: ChatGPT-User
24Allow: /
25
26
27User-agent: PerplexityBot
28Allow: /
29
30
31User-agent: ClaudeBot
32Allow: /
33
34
35User-agent: anthropic-ai
36Allow: /
37
38
39User-agent: Applebot-Extended
40Allow: /