NeuralCrawl

Academic & open research

The archive β€” every monitored site, its robots.txt history and AI-crawler stance.

All sites Top 1000 websites US 500 Nasdaq 100 AI models Cybersecurity SEO & AI search Creative rights-holders Knowledge & UGC Academic & open research Universities National indices Banks & fintech E-commerce European companies Social Networks Governments Publishers
# Company Domain Sector robots.txt AI bots blocked Snapshots Changes Last change
1 πŸ‡ΊπŸ‡Έ arXiv arxiv.org Research repository present 0 1 0 β€”
2 πŸ‡ΊπŸ‡Έ Google Scholar scholar.google.com Academic search present 0 1 0 β€”
3 πŸ‡ΊπŸ‡Έ PubMed / NCBI ncbi.nlm.nih.gov Research repository present 0 1 0 β€”
4 πŸ‡©πŸ‡ͺ ResearchGate researchgate.net Research network present 0 1 0 β€”
5 πŸ‡ΊπŸ‡Έ Academia.edu academia.edu Research network present 🤖 3 1 0 β€”
6 πŸ‡ΊπŸ‡Έ Semantic Scholar semanticscholar.org Academic search present 0 1 0 β€”
7 πŸ‡ΊπŸ‡Έ SSRN ssrn.com Preprint server present 🤖 3 1 0 β€”
8 πŸ‡ΊπŸ‡Έ bioRxiv biorxiv.org Preprint server present 🤖 7 1 0 β€”
9 πŸ‡ΊπŸ‡Έ medRxiv medrxiv.org Preprint server present 0 1 0 β€”
10 πŸ‡ΊπŸ‡Έ OpenAlex openalex.org Open catalog blocked 0 0 0 β€”
11 πŸ‡¨πŸ‡­ Zenodo zenodo.org Research repository present 0 1 0 β€”
12 πŸ‡ΊπŸ‡Έ Open Science Framework osf.io Research repository present 0 1 0 β€”
13 πŸ‡¬πŸ‡§ CORE core.ac.uk Research aggregator present 0 1 0 β€”
14 πŸ‡«πŸ‡· HAL Open Archive hal.science Research repository redirect 0 0 0 β€”
15 πŸ‡¬πŸ‡§ Europe PMC europepmc.org Research repository present 0 1 0 β€”
16 πŸ‡ΊπŸ‡Έ Crossref crossref.org Scholarly metadata present 0 1 0 β€”
17 πŸ‡¬πŸ‡§ DOAJ doaj.org Open access directory present 🤖 8 1 0 β€”
18 πŸ‡ΊπŸ‡Έ PLOS plos.org Open access journals present 0 1 0 β€”
19 πŸ‡ΊπŸ‡Έ ORCID orcid.org Researcher identifiers present 0 1 0 β€”
20 πŸ‡©πŸ‡ͺ DBLP dblp.org CS bibliography present 0 1 0 β€”