NeuralCrawl

πŸ‡ΊπŸ‡Έ Harvard University

harvard.edu · Universities · rank #4 · University · live robots.txt ↗

AI crawler access (latest snapshot, 3h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 749 bytes · sha256 534d2068c615 · raw

# START YOAST BLOCK
# ---------------------------
User-agent: *
Disallow:

Sitemap: https://www.harvard.edu/sitemap_index.xml
Sitemap: https://www.harvard.edu/president/sitemap_index.xml
Sitemap: https://www.harvard.edu/media-relations/sitemap_index.xml
Sitemap: https://www.harvard.edu/climate-and-sustainability/sitemap_index.xml
Sitemap: https://www.harvard.edu/wellbeing/sitemap_index.xml
Sitemap: https://www.harvard.edu/guidelines/sitemap_index.xml
Sitemap: https://www.harvard.edu/ai/sitemap_index.xml
Sitemap: https://www.harvard.edu/community/sitemap_index.xml
Sitemap: https://legacyofslavery.harvard.edu/sitemap_index.xml
Sitemap: https://www.harvard.edu/federal-lawsuits/sitemap_index.xml
# ---------------------------
# END YOAST BLOCK

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived