NeuralCrawl

State Farm Insurance / robots.txt snapshot

← back to statefarm.com · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 974 bytes · sha256 ae70df84b2baff63 · raw

final URL: https://www.statefarm.com/robots.txt

1User-agent: PetalBot
2Disallow: /
3
4User-Agent: *
5Allow: /.well-known/
6
7# Disallow - documents that shouldn't be indexed. Including both English and Spanish, just in case pages link to docs in other languages.
8Disallow: /content/dam/sf-library/en-us/secure/legacy/xlsx/*.xls
9Disallow: /content/dam/sf-library/en-us/secure/legacy/pdf/*.pdf
10Disallow: /content/dam/sf-library/en-us/secure/legacy/team-west/*.pdf
11Disallow: /content/dam/sf-library/es-us/secure/legacy/xlsx/*.xls
12Disallow: /content/dam/sf-library/es-us/secure/legacy/pdf/*.pdf
13Disallow: /content/dam/sf-library/es-us/secure/legacy/team-west/*.pdf
14
15# Disallow - old ones
16Disallow: /errors/
17Disallow: /_css/
18Disallow: /_images/
19Disallow: /_js/
20Disallow: /cdn/
21Disallow: /jscript/
22Disallow: /pzn_json_inc/
23Disallow: /status/
24Disallow: /role/
25Disallow: /general/
26Disallow: /samples/
27Disallow: /pdf/us/merchant-welcome-kit.pdf
28Disallow: /discountdoublecheck/
29
30# Sitemaps
31Sitemap: https://www.statefarm.com/sitemap.xml