rpp-noticias.io
robots.txt

Robots Exclusion Standard data for rpp-noticias.io

Resource Scan

Scan Details

Site Domain rpp-noticias.io
Base Domain rpp-noticias.io
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-03-19T09:13:07+00:00
Next Scan 2024-06-17T09:13:07+00:00

Last Successful Scan

Scanned2022-10-31T02:25:52+00:00
URL http://www.rpp-noticias.io/robots.txt
Redirect https://rpp.pe/robots.txt
Redirect Domain rpp.pe
Redirect Base rpp.pe
Response IP 65.9.66.66, 65.9.66.49, 65.9.66.70, 65.9.66.108
Found Yes
Hash 4d74f5fbab20611cd6e69b32bcc76c4310bbe627f4190f54936c09e09818504e
SimHash a016faa28f57

Groups

*
googlebot
googlebot-news
googlebot-image
googlebot-video
googlebot-mobile

Rule Path
Disallow /buscar/
Disallow /contactenos/*
Disallow /archivo/*/19*
Disallow /archivo/*/200*
Disallow /archivo/*/2010*
Disallow /archivo/*/2011*
Disallow /archivo/*/2012*
Disallow /archivo/*/2013*
Disallow /archivo/*/2014*
Disallow /archivo/*/2015*
Disallow /archivo/*/2016*
Disallow /archivo/*/2017*
Disallow /archivo/*/2018*
Disallow /archivo/19*
Disallow /archivo/200*
Disallow /archivo/2010*
Disallow /archivo/2011*
Disallow /archivo/2012*
Disallow /archivo/2013*
Disallow /archivo/2014*
Disallow /archivo/2015*
Disallow /archivo/2016*
Disallow /archivo/2017*
Disallow /archivo/2018*
Allow /sitemap/
Allow /sitemap/*

petalbot

Rule Path
Allow

ia_archiver

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

slurp

Rule Path
Disallow /

maxthon

Rule Path
Disallow /

cncdialer

Rule Path
Disallow /

Other Records

Field Value
sitemap https://rpp.pe/sitemap/news?ns_source=organico
sitemap https://rpp.pe/sitemap/news/politica?ns_source=organico
sitemap https://rpp.pe/sitemap/news/lima?ns_source=organico
sitemap https://rpp.pe/sitemap/news/mundo?ns_source=organico
sitemap https://rpp.pe/sitemap/news/futbol?ns_source=organico
sitemap https://rpp.pe/sitemap/news/economia?ns_source=organico
sitemap https://rpp.pe/sitemap/news/tecnologia?ns_source=organico
sitemap https://rpp.pe/sitemap/news/mundo?ns_source=organico
sitemap https://rpp.pe/sitemap/news/famosos?ns_source=organico
sitemap https://rpp.pe/sitemap/news/peru?ns_source=organico
sitemap https://rpp.pe/sitemap/news/cine?ns_source=organico
sitemap https://rpp.pe/sitemap/news/capital
sitemap https://rpp.pe/sitemap
sitemap https://rpp.pe/sitemap/web/politica?ns_source=organico
sitemap https://rpp.pe/sitemap/web/mundo?ns_source=organico
sitemap https://rpp.pe/sitemap/web/tecnologia?ns_source=organico
sitemap https://rpp.pe/sitemap/web/economia?ns_source=organico
sitemap https://rpp.pe/sitemap/web/futbol?ns_source=organico
sitemap https://rpp.pe/sitemap/web/famosos?ns_source=organico
sitemap https://rpp.pe/sitemap/web/cine?ns_source=organico
sitemap https://rpp.pe/sitemap/web/lima?ns_source=organico
sitemap https://rpp.pe/sitemap/web/peru?ns_source=organico
sitemap https://rpp.pe/sitemap/web/capital

Comments

  • Agentes nocivos conocidos