NeuralCrawl

πŸ‡ΊπŸ‡Έ Malwarebytes

malwarebytes.com · Cybersecurity · rank #48 · Cybersecurity · live robots.txt ↗

AI crawler access (latest snapshot, 5h ago)

blocked restricted allowed faded = inherited from the * wildcard group

GPTBot
ChatGPT-User
OAI-SearchBot
ClaudeBot
Claude-User
Claude-SearchBot
anthropic-ai
Claude-Web
CCBot
Google-Extended
Applebot-Extended
PerplexityBot
Perplexity-User
Bytespider
Amazonbot
FacebookBot
meta-externalagent
meta-externalfetcher
cohere-ai
AI2Bot
Diffbot
omgili
YouBot
DuckAssistBot
MistralAI-User
PanguBot
Timpibot

Current robots.txt 1817 bytes · sha256 9054793fd920 · raw

User-agent: *
User-Agent: Googlebot
User-Agent: applebot
User-Agent: PerplexityBot
User-Agent: bingbot
User-Agent:  OAI-SearchBot
User-Agent:  GPTBot
User-Agent: GeminiBot
User-Agent: ClaudeBot

Disallow: /wp-admin/
Allow: /wp-admin/admin-ajax.php
Disallow: /*?wg-choose-original= 
DISALLOW: /*?x-clickref=
DISALLOW: /*?amp
DISALLOW: /*?d=
DISALLOW: /*?sid2=
DISALLOW: /*.pdf
DISALLOW: /ctrl/*
DISALLOW: /v1a/*
DISALLOW: /*.php
DISALLOW: /404
DISALLOW: /press/2008*
DISALLOW: /press/2009*
DISALLOW: /press/2010*
DISALLOW: /press/2011*
DISALLOW: /press/2012*
DISALLOW: /press/2013*
DISALLOW: /press/2014*
DISALLOW: /press/2015*
DISALLOW: /press/2016*
DISALLOW: /press/2017*
DISALLOW: /press/2018*
DISALLOW: /press/2019*
DISALLOW: /press/2020*
DISALLOW: /press/2021*
DISALLOW: /press/2022*
DISALLOW: /press/category/in-the-news*
DISALLOW: /business/solutions/gdpr/contact-us
DISALLOW: /education/contact/thankyou
DISALLOW: /eula/*
DISALLOW: /finance/contact/thankyou
DISALLOW: /lp/*
DISALLOW: /mac-download
DISALLOW: /pricing/inapp/*
DISALLOW: /secure
DISALLOW: /secure/guidelines
DISALLOW: /support/thirdpartynotices/*
DISALLOW: /thank_you*
DISALLOW: /thank-you*
DISALLOW: /threezerodays
Disallow: *.ttf?
DISALLOW: /mbam-whats-new.json*
Sitemap: https://www.malwarebytes.com/sitemap_index.xml
Sitemap: https://www.malwarebytes.com/pt-br/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/nl/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/es/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/de/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/fr/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/pl/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/it/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/ru/page-sitemap.xml
Sitemap: https://www.malwarebytes.com/pt/page-sitemap.xml

Change history

  1. initial snapshot
    • First snapshot of robots.txt archived