notiecuador.com.ec
robots.txt

Robots Exclusion Standard data for notiecuador.com.ec

Resource Scan

Scan Details

Site Domain notiecuador.com.ec
Base Domain notiecuador.com.ec
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-06-05T20:25:41+00:00
Next Scan 2024-09-03T20:25:41+00:00

Last Successful Scan

Scanned2023-11-09T20:23:37+00:00
URL https://www.notiecuador.com.ec/robots.txt
Domain IPs 138.199.44.209, 185.103.37.72, 185.103.37.73, 51.210.0.138, 51.89.172.162
Response IP 185.103.37.72
Found Yes
Hash 196d4828576e4ad8989b76fa2f413faee2fb3b1f6313d3b885bd7b4012b44526
SimHash ea189276abba

Groups

*

Rule Path
Disallow /abonados/
Disallow /europa/
Disallow /europa2001/
Disallow /europa2003/
Disallow /boletin-joven-00135/
Disallow /rsc-00195/
Disallow /enviarnoticia.aspx
Disallow /enviatunoticia.aspx
Disallow /ws/AltaBoletin.ashx
Disallow /buscador.aspx

yandex

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

freshbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

heritrix

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

um-ic

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europapress.es/sitemap.xml
sitemap https://www.europapress.es/sitemap_index/