hn.cl
robots.txt

Robots Exclusion Standard data for hn.cl

Resource Scan

Scan Details

Site Domain hn.cl
Base Domain hn.cl
Scan Status Ok
Last Scan2024-10-03T16:29:20+00:00
Next Scan 2024-11-02T16:29:20+00:00

Last Scan

Scanned2024-10-03T16:29:20+00:00
URL https://hn.cl/robots.txt
Domain IPs 190.110.123.69
Response IP 190.110.123.69
Found Yes
Hash c40f9366c330235add863dbea1ef5c5fe53c370fc2f83b9b079815af1a3411c2
SimHash 6090584006b2

Groups

*

Rule Path
Disallow /search
Disallow /cgi-bin
Disallow /directorio/
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

gurujibot

Rule Path
Disallow /

hl_ftien_spider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yeti

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.hn.cl/sitemap_index.xml

Comments

  • Lista de bots bloqueados