NeuralCrawl

Open Science Framework / robots.txt snapshot

← back to osf.io · fetched 2026-06-26T14:15:22Z (4h ago) · HTTP 200 · 255 bytes · sha256 55143ee8903b3975 · raw

final URL: https://osf.io/robots.txt

1# www.robotstxt.org/
2
3User-agent: *
4Disallow: /api/*
5Disallow: *?view_only=
6crawl-delay: 10
7
8# Robots that have misbehaved
9User-agent: PingBot
10User-agent: PerplexityBot
11User-agent: GPTBot
12User-agent: BaiduSpider
13User-agent: Meta-ExternalAgent
14Disallow: *