NeuralCrawl

Halliburton / robots.txt snapshot

← back to halliburton.com · fetched 2026-06-20T01:10:30Z (18h ago) · HTTP 200 · 562 bytes · sha256 e0e9f225060cb5ff · raw

final URL: https://www.halliburton.com/robots.txt

1User-agent: *
2Allow: /
3
4Sitemap: https://www.halliburton.com/sitemap.xml
5
6
7# -----------------------------
8# AI data extraction / brokers
9# -----------------------------
10
11# Diffbot (AI data extraction & resale)
12User-agent: Diffbot
13Disallow: /
14
15User-agent: Diffbot-User
16Disallow: /
17
18# Webz.io / Omgili (data broker, AI training feeds)
19User-agent: webzio-extended
20Disallow: /
21
22User-agent: webzio
23Disallow: /
24
25User-agent: omgili
26Disallow: /
27
28User-agent: omgilibot
29Disallow: /
30
31# MyCentralAIScraperBot (Unknown scraper)
32User-agent: MyCentralAIScraperBot
33Disallow: /