notichile.cl
robots.txt

Robots Exclusion Standard data for notichile.cl

Resource Scan

Scan Details

Site Domain notichile.cl
Base Domain notichile.cl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-02T14:54:38+00:00
Next Scan 2024-05-31T14:54:38+00:00

Last Successful Scan

Scanned2023-11-03T23:16:13+00:00
URL https://notichile.cl/robots.txt
Redirect https://www.notichile.cl/robots.txt
Redirect Domain www.notichile.cl
Redirect Base notichile.cl
Domain IPs 138.199.44.209, 185.103.37.72, 185.103.37.73, 51.210.0.138, 51.89.172.162
Redirect IPs 138.199.44.209, 185.103.37.72, 185.103.37.73, 51.210.0.138, 51.89.172.162
Response IP 185.103.37.73
Found Yes
Hash 196d4828576e4ad8989b76fa2f413faee2fb3b1f6313d3b885bd7b4012b44526
SimHash ea189276abba

Groups

*

Rule Path
Disallow /abonados/
Disallow /europa/
Disallow /europa2001/
Disallow /europa2003/
Disallow /boletin-joven-00135/
Disallow /rsc-00195/
Disallow /enviarnoticia.aspx
Disallow /enviatunoticia.aspx
Disallow /ws/AltaBoletin.ashx
Disallow /buscador.aspx

yandex

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

freshbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

heritrix

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

um-ic

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europapress.es/sitemap.xml
sitemap https://www.europapress.es/sitemap_index/