NeuralCrawl

Google Scholar / robots.txt snapshot

← back to scholar.google.com · fetched 2026-06-26T14:15:22Z (4h ago) · HTTP 200 · 564 bytes · sha256 15a55fbdcbb3aaa7 · raw

final URL: https://scholar.google.com/robots.txt

1User-agent: *
2Disallow: /search
3Disallow: /index.html
4Disallow: /scholar
5Disallow: /citations?
6Allow: /citations?user=
7Disallow: /citations?*cstart=
8Disallow: /citations?user=*%40
9Disallow: /citations?user=*@
10Allow: /citations?view_op=list_classic_articles
11Allow: /citations?view_op=mandates_leaderboard
12Allow: /citations?view_op=metrics_intro
13Allow: /citations?view_op=new_profile
14Allow: /citations?view_op=sitemap
15Allow: /citations?view_op=top_venues
16
17User-agent: Twitterbot
18Disallow:
19
20User-agent: facebookexternalhit
21Disallow:
22
23User-agent: PetalBot
24Disallow: /