m.20minutos.es
robots.txt

Robots Exclusion Standard data for m.20minutos.es

Resource Scan

Scan Details

Site Domain m.20minutos.es
Base Domain 20minutos.es
Scan Status Ok
Last Scan2024-11-10T15:41:42+00:00
Next Scan 2024-12-10T15:41:42+00:00

Last Scan

Scanned2024-11-10T15:41:42+00:00
URL https://m.20minutos.es/robots.txt
Redirect https://www.20minutos.es/robots.txt
Redirect Domain www.20minutos.es
Redirect Base 20minutos.es
Domain IPs 3.164.182.125, 3.164.182.17, 3.164.182.21, 3.164.182.22
Redirect IPs 3.164.182.125, 3.164.182.17, 3.164.182.21, 3.164.182.22
Response IP 18.165.140.104
Found Yes
Hash 40d9084f68a17ae75a7d9c1ecf0b28b6d49b8b92986f789c19000b7e8e82f4fe
SimHash ba9e510b8066

Groups

*

Rule Path
Disallow /view/
Disallow /view/*
Disallow /buscar
Disallow /busqueda/
Disallow /busqueda/*
Disallow /imprimir/
Disallow /mini20
Disallow /mini20/
Disallow /home
Disallow /home/
Disallow /img_validator/
Disallow /aviso_comentario/
Disallow /enviar_amigo/
Disallow /usuarios/
Disallow /proc/
Disallow /iphoneapp
Disallow /iphoneapp/
Disallow /ajax
Disallow /compartir
Disallow /sso/
Disallow /especial/especial-de-prueba/
Disallow /especial/pruebas-comercial/
Disallow /widgets/
Disallow /boletin/baja/
Disallow /archivo/
Disallow /archivo/*
Disallow /*.woff2$
Disallow /*.ttf$

proximic

Rule Path
Disallow

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

flamingo_searchengine

Rule Path
Disallow /

exabot

Rule Path
Disallow /

moreoverbot

Rule Path
Disallow /

cyberalert

Rule Path
Disallow /

newscan

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

lexxebot/1.0

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

sitebot/0.1

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

netseer crawler

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

discobot

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

psbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

bl.uk_lddc_bot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

bender

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

yasni

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

integromedb

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

wesee:search

Rule Path
Disallow /

admantx

Rule Path
Disallow /

spbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

omgili

Rule Path
Disallow /

diffbot

Rule Path
Disallow /

timpibot

Rule Path
Disallow /

webzio-extended

Rule Path
Disallow /

Comments

  • Agentes permitidos explicitamente
  • Agentes bloqueados por idioma
  • Agentes nocivos
  • AI data scrappers

Warnings

  • 2 invalid lines.