NeuralCrawl

๐Ÿ‡ง๐Ÿ‡ท Uol

uol.com.br · Top 1000 websites · rank #134 · Web · live robots.txt ↗

AI crawler access (latest snapshot, 1h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 2470 bytes · sha256 0410a02c1c3d · raw

# robots.txt
#
User-agent: *
Sitemap: https://www.uol.com.br/carros/sitemap/v2/news-01.xml
Sitemap: https://www.uol.com.br/ecoa/sitemap/news-01.xml
Sitemap: https://www.uol.com.br/esporte/sitemap/v2/news-01.xml
Sitemap: https://www.uol.com.br/nossa/sitemap/news-01.xml
Sitemap: https://www.uol.com.br/splash/sitemap/news-01.xml
Sitemap: https://www.uol.com.br/tilt/sitemap/news-01.xml
Sitemap: https://www.uol.com.br/universa/sitemap/v2/news-01.xml
Sitemap: https://www.uol.com.br/vivabem/sitemap/v2/news-01.xml
Sitemap: https://c.jsuol.com.br/assets/jupiter-news/?resource-id=sitemap&source=toca/index.xml
Allow: /
Disallow: /carros/dev/
User-agent: Google-Extended
Disallow: /
Allow: /.ghtm$
User-agent: AI2Bot
Disallow: /
User-agent: Amazonbot
Disallow: /
User-agent: Anthropic-ai
Disallow: /
User-agent: Applebot
Disallow: /
User-agent: Applebot-Extended
Disallow: /
User-agent: AwarioRssBot
Disallow: /
User-agent: AwarioSmartBot
Disallow: /
User-agent: Bytespider
Disallow: /
User-agent: CCBot
Disallow: /
User-agent: Claude-SearchBot
Disallow: /
User-agent: Claude-User
Disallow: /
User-agent: Claude-Web
Disallow: /
User-agent: ClaudeBot
Disallow: /
User-agent: Cohere-ai
Disallow: /
User-agent: cohere-training-data-crawler
Disallow: /
User-agent: Cotoyogi
Disallow: /
User-agent: DataForSeoBot
Disallow: /
User-agent: Datenbank Crawler
Disallow: /
User-agent: DeepSeek
Disallow: /
User-agent: Devin
Disallow: /
User-agent: Diffbot
Disallow: /
User-agent: DuckAssistBot
Disallow: /
User-agent: FacebookBot
Disallow: /
User-agent: Factset_spyderbot
Disallow: /
User-agent: Gemini-Deep-Research
Disallow: /
User-agent: GoogleAgent-Mariner
Disallow: /
User-agent: ICC-Crawler
Disallow: /
User-agent: ImagesiftBot
Disallow: /
User-agent: Kangaroo Bot
Disallow: /
User-agent: Meta-ExternalAgent
Disallow: /
User-agent: Meta-ExternalFetcher
Disallow: /
User-agent: MistralAI-User
Disallow: /
User-agent: netEstate Imprint Crawler
Disallow: /
User-agent: NovaAct
Disallow: /
User-agent: omgili
Disallow: /
User-agent: Omgilibot
Disallow: /
User-agent: Operator
Disallow: /
User-agent: PanguBot
Disallow: /
User-agent: Peer39_crawler/1.0
Disallow: /
User-agent: Perplexity-User
Disallow: /
User-agent: PerplexityBot
Disallow: /
User-agent: PetalBot
Disallow: /
User-agent: Qwen
Disallow: /
User-agent: SemrushBot-OCOB
Disallow: /
User-agent: Timpibot
Disallow: /
User-agent: Twitterbot
Disallow: /
User-agent: Webzio-Extended
Disallow: /
User-agent: YouBot
Disallow: /

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived