NeuralCrawl

Massachusetts Institute of Technology / robots.txt snapshot

← back to mit.edu · fetched 2026-06-26T14:15:22Z (4h ago) · HTTP 200 · 199 bytes · sha256 bad4e8ead1b0414e · raw

final URL: https://web.mit.edu/robots.txt

1User-agent: *
2Disallow: /afs/
3Disallow: /cgi-bin/
4Disallow: /user/
5Disallow: /org/
6Disallow: /activity/
7Disallow: /contrib/
8Disallow: /dept/
9Disallow: /software/
10Disallow: /bin/
11Disallow: */Public/*