Oncrawl / robots.txt snapshot
← back to oncrawl.com · fetched 2026-06-20T14:56:27Z (4h ago) · HTTP 200 · 744 bytes · sha256 2ee140d5bb36a02e · raw
final URL: https://oncrawl.com/robots.txt
| 1 | User-agent: Googlebot |
| 2 | Disallow: /ads.txt |
| 3 | Disallow: /wpcb/wp-admin/ |
| 4 | Disallow: /search/ |
| 5 | Disallow: /*&sa=* |
| 6 | Disallow: /&sa=* |
| 7 | Disallow: /?s=* |
| 8 | Disallow: /.well-known* |
| 9 | Disallow: /wpcb/wp-json/contact-form* |
| 10 | Disallow: /wpcb/wp-includes/wlwmanifest.xml |
| 11 | Disallow: /web/app/modules/* |
| 12 | Disallow: /?author=* |
| 13 | Disallow: /*feed/ |
| 14 | Disallow: /*hubs* |
| 15 | Allow: /tag/content-hubs/ |
| 16 | |
| 17 | User-agent: * |
| 18 | Disallow: /ads.txt |
| 19 | Disallow: /wpcb/wp-admin/ |
| 20 | Disallow: /search/ |
| 21 | Disallow: /*&sa=* |
| 22 | Disallow: /&sa=* |
| 23 | Disallow: /?s=* |
| 24 | Disallow: /.well-known* |
| 25 | Disallow: /wpcb/wp-json/contact-form* |
| 26 | Disallow: /wpcb/wp-includes/wlwmanifest.xml |
| 27 | Disallow: /web/app/modules/* |
| 28 | Disallow: /?author=* |
| 29 | |
| 30 | |
| 31 | Sitemap: https://www.oncrawl.com/sitemap_index.xml |
| 32 | Sitemap: https://fr.oncrawl.com/sitemap_index.xml |