SEO & AI search
The archive — every monitored site, its robots.txt history and AI-crawler stance.
All sites
Top 1000 websites
US 500
Nasdaq 100
AI labs
Cybersecurity
SEO & AI search
Creative rights-holders
Knowledge & UGC
National indices
Banks & fintech
European companies
Social Networks
Governments
Publishers
| # | Company | Domain | Sector | robots.txt | AI bots blocked | Snapshots | Changes | Last change | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | 🇺🇸 | Semrush |
|
SEO software | present | 0 | 1 | 0 | — |
| 2 | 🇸🇬 | Ahrefs |
|
SEO software | present | 0 | 1 | 0 | — |
| 3 | 🇺🇸 | Similarweb |
|
Web intelligence | present | 0 | 1 | 0 | — |
| 4 | 🇺🇸 | Moz |
|
SEO software | present | 0 | 1 | 0 | — |
| 5 | 🇺🇸 | Profound |
|
AI search analytics | present | 0 | 1 | 0 | — |
| 6 | 🇺🇸 | NeuralCrawl |
|
Neural Crawler | present | 0 | 1 | 0 | — |
| 7 | 🇺🇸 | BrightEdge |
|
SEO software | present | 0 | 1 | 0 | — |
| 8 | 🇺🇸 | seoClarity |
|
SEO software | present | 0 | 1 | 0 | — |
| 9 | 🇫🇷 | Botify |
|
SEO software | present | 0 | 1 | 0 | — |
| 10 | 🇺🇸 | Conductor |
|
SEO software | present | 0 | 1 | 0 | — |
| 11 | 🇬🇧 | Lumar |
|
SEO crawler | present | 0 | 1 | 0 | — |
| 12 | 🇵🇱 | Surfer |
|
SEO software | present | 0 | 1 | 0 | — |
| 13 | 🇺🇸 | Clearscope |
|
SEO software | present | 0 | 1 | 0 | — |
| 14 | 🇬🇧 | Screaming Frog |
|
SEO crawler | present | 0 | 1 | 0 | — |
| 15 | 🇬🇧 | Sitebulb |
|
SEO crawler | present | 0 | 1 | 0 | — |
| 16 | 🇫🇷 | Oncrawl |
|
SEO crawler | present | 0 | 1 | 0 | — |
| 17 | 🇬🇧 | Majestic |
|
SEO intelligence | present | 0 | 1 | 0 | — |
| 18 | 🇺🇸 | SerpApi |
|
Search API | present | 0 | 1 | 0 | — |
| 19 | 🇺🇸 | Schema.org |
|
Structured data standard | present | 0 | 1 | 0 | — |
| 20 | 🇺🇸 | Algolia |
|
Site search | present | 0 | 1 | 0 | — |
| 21 | 🇨🇦 | Coveo |
|
Enterprise search | unreachable | 0 | 0 | 0 | — |
| 22 | 🇺🇸 | Elastic |
|
Enterprise search | present | 0 | 1 | 0 | — |
| 23 | 🇺🇸 | Common Crawl |
|
Web corpus | present | 0 | 1 | 0 | — |
| 24 | 🇺🇸 | Diffbot |
|
Structured web data | present | 0 | 1 | 0 | — |
| 25 | 🇮🇱 | Webz.io |
|
Web data feeds | present | 0 | 1 | 0 | — |
| 26 | 🇺🇸 |
|
Search engine | present | 0 | 1 | 0 | — | |
| 27 | 🇺🇸 | Alphabet |
|
Search engine | unreachable | 0 | 0 | 0 | — |
| 28 | 🇺🇸 | Microsoft Bing |
|
Search engine | present | 0 | 1 | 0 | — |
| 29 | 🇺🇸 | Brave Search |
|
Search engine | present | 0 | 1 | 0 | — |
| 30 | 🇺🇸 | DuckDuckGo |
|
Search engine | present | 0 | 1 | 0 | — |
| 31 | 🇺🇸 | Yahoo Search |
|
Search engine | present | 🤖 14 | 1 | 0 | — |
| 32 | 🇨🇳 | Baidu |
|
Technology | present | 0 | 1 | 0 | — |
| 33 | 🇷🇺 | Yandex |
|
Search engine | present | 0 | 1 | 0 | — |
| 34 | 🇰🇷 | Naver |
|
Search engine | present | 0 | 1 | 0 | — |
| 35 | 🇺🇸 | Kagi |
|
Search engine | present | 0 | 1 | 0 | — |
| 36 | 🇩🇪 | Ecosia |
|
Search engine | present | 0 | 1 | 0 | — |
| 37 | 🇫🇷 | Qwant |
|
Search engine | present | 0 | 1 | 0 | — |
| 38 | 🇳🇱 | Startpage |
|
Search engine | present | 0 | 1 | 0 | — |
| 39 | 🇬🇧 | Mojeek |
|
Search engine | present | 0 | 1 | 0 | — |
| 40 | 🇺🇸 | Perplexity |
|
LLM | present | 0 | 1 | 0 | — |
| 41 | 🇺🇸 | OpenAI |
|
LLM | present | 0 | 1 | 0 | — |
| 42 | 🇺🇸 | Anthropic |
|
LLM | present | 0 | 1 | 0 | — |
| 43 | 🇺🇸 | You.com |
|
AI answer engine | present | 0 | 1 | 0 | — |
| 44 | 🇺🇸 | Phind |
|
AI answer engine | blocked | 0 | 0 | 0 | — |
| 45 | 🇺🇸 | Exa |
|
AI search API | present | 0 | 1 | 0 | — |
| 46 | 🇺🇸 | Tavily |
|
AI search API | present | 0 | 1 | 0 | — |
| 47 | 🇺🇸 | Andi Search |
|
AI answer engine | present | 0 | 1 | 0 | — |
| 48 | 🇯🇵 | Felo |
|
AI answer engine | present | 0 | 1 | 0 | — |
| 49 | 🇺🇸 | Komo |
|
AI answer engine | present | 0 | 1 | 0 | — |
| 50 | 🇺🇸 | Waldo |
|
AI research search | absent | 0 | 0 | 0 | — |
| 51 | 🇺🇸 | Consensus |
|
AI research search | present | 0 | 1 | 0 | — |
| 52 | 🇺🇸 | Arc Search |
|
AI browser search | present | 0 | 1 | 0 | — |
| 53 | 🇺🇸 | Poe |
|
AI chat aggregation | present | 0 | 1 | 0 | — |