expansion.com
robots.txt

Robots Exclusion Standard data for expansion.com

Resource Scan

Scan Details

Site Domain expansion.com
Base Domain expansion.com
Scan Status Ok
Last Scan2024-09-20T18:13:24+00:00
Next Scan 2024-09-27T18:13:24+00:00

Last Scan

Scanned2024-09-20T18:13:24+00:00
URL https://expansion.com/robots.txt
Redirect https://www.expansion.com/robots.txt
Redirect Domain www.expansion.com
Redirect Base expansion.com
Domain IPs 2001:67c:2294:1000::f199, 34.90.247.117
Redirect IPs 199.232.193.50, 199.232.197.50
Response IP 151.101.41.50
Found Yes
Hash 1b06a32398fa9aac08c22268b3870273c975becdb6a25344255e4dc1188ecbe6
SimHash b3be73c0c9f4

Groups

*

Rule Path
Disallow /s/
Disallow /registro/*
Disallow /buscador/*
Disallow /avisolegal/index.html
Disallow /usuario/panelcontrol/AtencionCliente
Disallow /ed/*
Disallow /especiales/philips*
Disallow /especiales/cepsa/2015/11/27/5656f5efca4741354a8b456c.html
Disallow /economia-digital/2015/12/02/565ee35de2704e08348b4579.html
Disallow /2013/05/24/valencia/1369421699.html
Disallow /accesible/2013/04/24/catalunya/1366824807.html
Disallow /2013/04/24/catalunya/1366824807.html
Disallow /2013/04/22/juridico/1366657780.html
Disallow /juridico/sentencias/2015/12/28/56816af522601dd7178b459c.html
Disallow /2015/02/03/juridico/1422988139.html
Disallow /accesible/2011/11/18/catalunya/1321648436.html
Disallow /2011/12/09/catalunya/1323418073.html
Disallow /ejecutivo-administrador/fernandez-jambrina-alicia_2265029_G50.html
Disallow /blogs/peon-de-dama/2012/11/15/bosques-naturales-la-ultima-estafa.html
Disallow /agencia/efe/2012/07/30/17492097.html
Disallow /2009/04/28/empresas/1240930114.html
Disallow /accesible/2011/12/09/catalunya/1323418073.html
Disallow /2014/10/31/juridico/1414757650.html
Disallow /2011/11/18/catalunya/1321648436.html
Disallow /2008/04/16/empresas/energia/1112875.html
Disallow /2012/08/08/empresas/1344440327.html
Disallow /apw.js*
Disallow *zonadescargas/obtenerDocumento.html?codigo=*
Disallow /ultima_hora/index.html?year=*
Disallow /edicion_impresa/calendario.html?month=*
Disallow /includes/calendarios/calendarioRadar.html?year*
Disallow /opinion/documentos/hemeroteca.html?*

addthis.com

Rule Path
Disallow /

admantx

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

bender

Rule Path
Disallow /

bixocrawler

Rule Path
Disallow /

bl.uk_lddc_bot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

cyberalert

Rule Path
Disallow /

digext

Rule Path
Disallow /

discobot

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

dloader

Rule Path
Disallow /

dloader(naverrobot)

Rule Path
Disallow /

doc

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

dts agent

Rule Path
Disallow /

exabot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

fairshare

Rule Path
Disallow /

fetch

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

genieo

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

heritrix/3.3.0

Rule Path
Disallow /

httrack

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

integromedb

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kimengi

Rule Path
Disallow /

kimengi/nineconnections.com

Rule Path
Disallow /

larbin

Rule Path
Disallow /

lexxebot/1.0

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linko

Rule Path
Disallow /

livelapbot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

metauri

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

moreover

Rule Path
Disallow /

moreoverbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

nabot

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

netseer crawler

Rule Path
Disallow /

newscan

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

oai-searchbot

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

proximic

Rule Path
Disallow /

psbot

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

sitebot/0.1

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

slurp

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

umbot-ln

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

universalfeedparser

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wesee:search

Rule Path
Disallow /

wget

Rule Path
Disallow /

wotbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

xenu

Rule Path
Disallow /

yasni

Rule Path
Disallow /

zao

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Comments

  • Diciembre 2020
  • Bloqueo de bots y crawlers poco útiles

Warnings

  • 2 invalid lines.