Academic & open research
The archive β every monitored site, its robots.txt history and AI-crawler stance.
All sites
Top 1000 websites
US 500
Nasdaq 100
AI models
Cybersecurity
SEO & AI search
Creative rights-holders
Knowledge & UGC
Academic & open research
Universities
National indices
Banks & fintech
E-commerce
European companies
Social Networks
Governments
Publishers
| # | Company | Domain | Sector | robots.txt | AI bots blocked | Snapshots | Changes | Last change | |
|---|---|---|---|---|---|---|---|---|---|
| 1 | πΊπΈ | arXiv |
|
Research repository | present | 0 | 1 | 0 | β |
| 2 | πΊπΈ | Google Scholar |
|
Academic search | present | 0 | 1 | 0 | β |
| 3 | πΊπΈ | PubMed / NCBI |
|
Research repository | present | 0 | 1 | 0 | β |
| 4 | π©πͺ | ResearchGate |
|
Research network | present | 0 | 1 | 0 | β |
| 5 | πΊπΈ | Academia.edu |
|
Research network | present | 🤖 3 | 1 | 0 | β |
| 6 | πΊπΈ | Semantic Scholar |
|
Academic search | present | 0 | 1 | 0 | β |
| 7 | πΊπΈ | SSRN |
|
Preprint server | present | 🤖 3 | 1 | 0 | β |
| 8 | πΊπΈ | bioRxiv |
|
Preprint server | present | 🤖 7 | 1 | 0 | β |
| 9 | πΊπΈ | medRxiv |
|
Preprint server | present | 0 | 1 | 0 | β |
| 10 | πΊπΈ | OpenAlex |
|
Open catalog | blocked | 0 | 0 | 0 | β |
| 11 | π¨π | Zenodo |
|
Research repository | present | 0 | 1 | 0 | β |
| 12 | πΊπΈ | Open Science Framework |
|
Research repository | present | 0 | 1 | 0 | β |
| 13 | π¬π§ | CORE |
|
Research aggregator | present | 0 | 1 | 0 | β |
| 14 | π«π· | HAL Open Archive |
|
Research repository | redirect | 0 | 0 | 0 | β |
| 15 | π¬π§ | Europe PMC |
|
Research repository | present | 0 | 1 | 0 | β |
| 16 | πΊπΈ | Crossref |
|
Scholarly metadata | present | 0 | 1 | 0 | β |
| 17 | π¬π§ | DOAJ |
|
Open access directory | present | 🤖 8 | 1 | 0 | β |
| 18 | πΊπΈ | PLOS |
|
Open access journals | present | 0 | 1 | 0 | β |
| 19 | πΊπΈ | ORCID |
|
Researcher identifiers | present | 0 | 1 | 0 | β |
| 20 | π©πͺ | DBLP |
|
CS bibliography | present | 0 | 1 | 0 | β |