NeuralCrawl

πŸ‡ΊπŸ‡Έ Reuters

reuters.com · Publishers · rank #7 · Newswire · live robots.txt ↗

AI crawler access (latest snapshot, 13h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 3700 bytes · sha256 e95fb3c36e6b · raw

User-agent: AASA-Bot
User-agent: ADmantX
User-agent: AdsBot-Google
User-agent: AdsBot-Google-Mobile
User-agent: AmazonAdBot
User-agent: Amzn-User
User-agent: Applebot
User-agent: AppleNewsBot
User-agent: Bingbot
User-agent: BingPreview
User-agent: BrightEdge
User-agent: BrightEdgeOnCrawl
User-agent: CensysInspect
User-agent: ChatGPT-User
User-agent: Cision
User-agent: Clickagy
User-agent: Concert
User-agent: ContextualBot
User-agent: CriteoBot
User-agent: DatadogSynthetics
User-agent: datadome-pageprotect-scanner
User-agent: Discordbot
User-agent: doubleverify
User-agent: DuckDuckBot
User-agent: ElevenlabsBot
User-agent: Embedly 
User-agent: facebookexternalhit
User-agent: Googlebot
User-agent: Googlebot Smartphone
User-agent: Googlebot-News
User-agent: Google-Display-Ads-Bot
User-agent: Google-InspectionTool
User-agent: GoogleOther
User-agent: Google-Read-Aloud
User-agent: Google-Safety
User-agent: Google-Site-Verification
User-agent: GTmetrix
User-agent: GumGumBot
User-agent: ias_crawler
User-agent: Iframely
User-agent: leiki
User-agent: LinkedInBot
User-agent: LinkTiger
User-agent: Mantisbot
User-agent: Mediapartners-Google
User-agent: meta-externalads
User-agent: meta-webindexer
User-agent: MicrosoftPreview
User-agent: MJ12bot 
User-agent: Moreover
User-agent: msnbot 
User-agent: NFBNewslineRobot
User-agent: OAI-SearchBot
User-agent: Oncrawl
User-agent: Opebot-v
User-agent: Opoint
User-agent: Optimizer
User-agent: outbrain 
User-agent: Pinterestbot
User-agent: Prerender
User-agent: Proofpoint 
User-agent: proximic
User-agent: PubMatic Crawler Bot
User-agent: Quantcastbot
User-agent: Qwantbot
User-agent: Reuters SEO Screaming Frog Spider 007
User-agent: Reuters-NAUWI
User-agent: Scom-Crawler-For-Reuters
User-agent: SinceraSyntheticUser
User-agent: Slurp
User-agent: SmartologyBot
User-agent: snews
User-agent: SocialFlow
User-agent: StatusCake
User-agent: Storebot-Google
User-agent: Stripebot
User-agent: TTD-Content
User-agent: Twitterbot
User-agent: URLDefense
User-agent: Verity
User-agent: vuln_scan_by_trustedsite_com_halo_security
User-agent: WISEbot
User-agent: Xenu Link Sleuth
User-agent: Yahoo Link Preview
User-agent: Yahoo! JAPAN
User-agent: YahooMailProxy
Disallow: /finance/stocks/option
Disallow: /finance/stocks/financialHighlights
Disallow: /search
Disallow: /site-search/
Disallow: /beta
Disallow: /designtech
Disallow: /featured-optimize
Disallow: /energy-test
Disallow: /article/beta
Disallow: /sponsored/previewcampaign
Disallow: /sponsored/previewarticle
Disallow: /test/
Disallow: /news/archive/commentary
Disallow: /brandfeatures/venture-capital
Disallow: /assets/siteindex
Disallow: /article/api/
Disallow: /practical-law-the-journal/search/
Disallow: /pf/api/
Disallow: /fr/
Disallow: /it/
Disallow: /es/
Disallow: /pt/
Disallow: /de/
Disallow: /latam/
Disallow: /account/subscribe/payment/

# Block all other bots
User-agent: *
Allow: /plus/
Disallow: /

SITEMAP: https://www.reuters.com/arc/outboundfeeds/sitemap-index/?outputType=xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/news-sitemap-index/?outputType=xml
SITEMAP: https://www.reuters.com/plus/sitemap-index.xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/sitemap-plj-index/?outputType=xml
SITEMAP: https://www.reuters.com/graphics/sitemap.xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/sitemap-index/pictures/?outputType=xml
SITEMAP: https://www.reuters.com/static/video-sitemap/us/sitemap_video_index.xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/topic-sitemap/?outputType=xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/author-sitemap/?outputType=xml
SITEMAP: https://www.reuters.com/arc/outboundfeeds/pressrelease-sitemap/?outputType=xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived

Editorial profile Content bias, reliability & geopolitical trust

Content biasCenter CredibilityHigh reliability

Content bias (political lean) and credibility (factual track record) are third-party/aggregated assessments β€” source: AllSides/MBFC β€” and may be contested.