primicias.ec
robots.txt

Robots Exclusion Standard data for primicias.ec

Resource Scan

Scan Details

Site Domain primicias.ec
Base Domain primicias.ec
Scan Status Ok
Last Scan2024-11-16T16:23:23+00:00
Next Scan 2024-11-23T16:23:23+00:00

Last Scan

Scanned2024-11-16T16:23:23+00:00
URL https://primicias.ec/robots.txt
Redirect https://www.primicias.ec/robots.txt
Redirect Domain www.primicias.ec
Redirect Base primicias.ec
Domain IPs 13.226.2.40, 13.226.2.62, 13.226.2.64, 13.226.2.81, 2600:9000:21f8:1600:c:970c:3c0:93a1, 2600:9000:21f8:2e00:c:970c:3c0:93a1, 2600:9000:21f8:6400:c:970c:3c0:93a1, 2600:9000:21f8:9a00:c:970c:3c0:93a1, 2600:9000:21f8:b200:c:970c:3c0:93a1, 2600:9000:21f8:b600:c:970c:3c0:93a1, 2600:9000:21f8:ee00:c:970c:3c0:93a1, 2600:9000:21f8:fa00:c:970c:3c0:93a1
Redirect IPs 18.161.111.110, 18.161.111.35, 18.161.111.69, 18.161.111.73, 2600:9000:23d1:1a00:c:970c:3c0:93a1, 2600:9000:23d1:600:c:970c:3c0:93a1, 2600:9000:23d1:6e00:c:970c:3c0:93a1, 2600:9000:23d1:800:c:970c:3c0:93a1, 2600:9000:23d1:8200:c:970c:3c0:93a1, 2600:9000:23d1:9200:c:970c:3c0:93a1, 2600:9000:23d1:9c00:c:970c:3c0:93a1, 2600:9000:23d1:f600:c:970c:3c0:93a1
Response IP 108.156.22.41
Found Yes
Hash 8abd7fb01c34c48624600cb47fc698aca3044c3950f4ff6057917a1b49d9829f
SimHash a5985913cc75

Groups

noxtrumbot
msnbot
slurp
webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

msiecrawler

Rule Path
Allow /

microsoft.url.control

Rule Path
Allow /

bingbot

Rule Path
Allow /

twitterbot

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.primicias.ec/sitemap-index.xml
sitemap https://www.primicias.ec/sitemap-google-news.xml

Comments

  • Ralentizamos algunos bots que se suelen volver locos
  • Crawl-delay: 20
  • Crawl-delay: 20
  • Crawl-delay: 20
  • Bloqueo de bots y crawlers poco utiles
  • Index Bingbot
  • sitemaps