vistaprevia.elcorreoweb.es
robots.txt

Robots Exclusion Standard data for vistaprevia.elcorreoweb.es

Resource Scan

Scan Details

Site Domain vistaprevia.elcorreoweb.es
Base Domain elcorreoweb.es
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-06-06T09:03:39+00:00
Next Scan 2024-09-04T09:03:39+00:00

Last Successful Scan

Scanned2024-01-16T09:00:48+00:00
URL https://vistaprevia.elcorreoweb.es/robots.txt
Redirect https://elcorreoweb.es/robots.txt
Redirect Domain elcorreoweb.es
Redirect Base elcorreoweb.es
Domain IPs 89.140.110.20
Redirect IPs 89.140.110.20
Response IP 89.140.110.20
Found Yes
Hash 45b1e1a9b81347445d1c1932a915a66a7b4cceac96443960ae41acf00472a1ac
SimHash ba5f4284c833

Groups

*

Rule Path
Disallow /news-portlet/metalocator/
Disallow /news-portlet/html/teaser-viewer-portlet/teaser_page.jsp
Disallow /news-portlet/html/teaser-viewer-portlet/teaser_filter.jsp
Disallow /news-portlet/filterteaser/
Disallow /news-portlet/getfilteropts/
Disallow /tracking-portlet/html/ranking-viewer/ranking_details.jsp
Disallow /user-portlet/login-with/
Disallow /user-portlet/edit-user-profile/
Disallow /user-portlet/reset-credentials/
Disallow /user-portlet/confirm-email/
Disallow /user-portlet/refreshuserentitlements/
Disallow /user-portlet/getEntitlements/
Disallow /group/
Disallow /user/
Disallow /web/
Disallow /image/
Disallow /busqueda/

*

Rule Path
Disallow /error404
Disallow /pagina-en-construccion
Disallow /servicios/html
Disallow /lab-protec

ia_archiver
ubicrawler
doc
zao
sitecheck.internetseer.com
zealbot
msiecrawler
sitesnagger
webstripper
webcopier
fetch
offline explorer
teleport
teleportpro
webzip
linko
httrack
microsoft.url.control
xenu
larbin
libwww
zyborg
download ninja
slurp
maxthon
cncdialer
-grub-client
ia_archiver
ia_archiver-web.archive.org
k2spider
libwww
wget
adequat
adequat-systems
amisoftware
ask n read
augure
auramundi
coexel
converacrawler
corporama
digimind
ellisphere
eureka
eureka.cc
europresse
kbcrawl
knowings
leadbox
linkfluence
manageo
mediacompil
meltwater
mention
moreover
mytwip
newsnow
newzbin
opinion-tracker
proxem
qwam content intelligence
score3
sindup
societe.com
spotter
synthesio
trendeo
trendybuzz
vecteurplus
verif
verticalsearch
vsw
winello
compspybot
curious george
cybeye.com
docomo
exb language crawler
ezooms
flamingo_searchengine
genieo
genio
lwnutch
lexxebot
openwebindex
rediffnewsbot
seoengworldbot
scanmine
screaming frog seo spider
shopwiki
showyoubot
sosospider
wocbot
yeti
yeti
youdaobot
daumoa
gsa-crawler
libcrawl
linkdex
magpie-crawler
repparser
rogerbot
sindice-site-manager
sogou spider
sogou
woriobot
yacybot
yolinkbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://elcorreoweb.es/sitemap.xml
sitemap https://elcorreoweb.es/sitemapforgoogle.xml
sitemap http://elcorreoweb.es/wp-content/megasitemap-index.xml

Comments

  • Agentes nocivos conocidos
  • Todos estos agentes especificamente prohibidos

Warnings

  • 1 invalid line.