NeuralCrawl

Builders FirstSource / robots.txt snapshot

← back to bldr.com · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 2447 bytes · sha256 6e55805052ae324e · raw

final URL: https://www.bldr.com/robots.txt

1
2
3# For all robots
4User-agent: *
5
6# Block access to specific groups of pages
7Disallow: /cart
8Disallow: /checkout
9Disallow: /my-account
10Disallow: /request-quote-success
11
12# Allow search crawlers to discover the sitemap
13Sitemap: https://www.bldr.com/sitemap.xml
14
15# Block CazoodleBot as it does not present correct accept content headers
16User-agent: CazoodleBot
17Disallow: /
18
19# Block MJ12bot as it is just noise
20User-agent: MJ12bot
21Disallow: /
22
23# Block dotbot as it cannot parse base urls properly
24User-agent: dotbot/1.0
25Disallow: /
26
27# Block Gigabot
28User-agent: Gigabot
29Disallow: /
30
31
32
33# BLDRFST-1404 - Added following explicit allows, although not really necessary since we have universal allows at the top
34# ========================================
35# AI SEARCH & CITATION BOTS - ALLOW THESE
36# ========================================
37# These bots fetch content in real-time when users ask questions
38# Allowing these ensures your content appears in AI answers
39
40# OpenAI - ChatGPT Search & Citations
41User-agent: OAI-SearchBot
42Allow: /
43
44User-agent: ChatGPT-User
45Allow: /
46
47# Anthropic - Claude Citations & Search
48User-agent: ClaudeBot
49Allow: /
50
51User-agent: Claude-Web
52Allow: /
53
54User-agent: Claude-User
55Allow: /
56
57User-agent: Claude-SearchBot
58Allow: /
59
60# Google - Gemini Search (for AI Overviews)
61User-agent: Googlebot
62Allow: /
63
64# Perplexity - AI Search
65User-agent: PerplexityBot
66Allow: /
67
68User-agent: Perplexity-User
69Allow: /
70
71# ========================================
72# AI TRAINING BOTS - YOUR CHOICE
73# ========================================
74# These collect data to train models
75# You can allow or block these without affecting citations
76
77# OpenAI Training (optional - allow if you want content in future models)
78User-agent: GPTBot
79Allow: /
80
81# Anthropic Training (optional)
82User-agent: anthropic-ai
83Allow: /
84
85# Google AI Training (optional)
86User-agent: Google-Extended
87Allow: /
88
89# Common Crawl (used by many AI companies)
90User-agent: CCBot
91Allow: /
92
93# Apple Intelligence
94User-agent: Applebot-Extended
95Allow: /
96
97# Meta/Facebook AI
98User-agent: Meta-ExternalAgent
99Allow: /
100
101User-agent: Meta-ExternalFetcher
102Allow: /
103
104# Amazon AI
105User-agent: Amazonbot
106Allow: /
107
108# ByteDance/TikTok
109User-agent: Bytespider
110Allow: /
111
112# You.com
113User-agent: YouBot
114Allow: /
115
116# ========================================
117# PROTECTED AREAS (if needed)
118# ========================================
119# Example: Block all bots from private areas
120# User-agent: *
121# Disallow: /general-test