m.europapress.es
robots.txt

Robots Exclusion Standard data for m.europapress.es

Resource Scan

Scan Details

Site Domain m.europapress.es
Base Domain europapress.es
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-25T02:01:46+00:00
Next Scan 2024-06-23T02:01:46+00:00

Last Successful Scan

Scanned2023-11-04T01:46:32+00:00
URL https://m.europapress.es/robots.txt
Redirect https://www.europapress.es/robots.txt
Redirect Domain www.europapress.es
Redirect Base europapress.es
Domain IPs 138.199.8.193, 143.244.35.226, 51.81.243.73, 51.81.66.107
Redirect IPs 138.199.44.209, 185.103.37.72, 185.103.37.73, 51.210.0.138, 51.89.172.162
Response IP 185.103.37.73
Found Yes
Hash 196d4828576e4ad8989b76fa2f413faee2fb3b1f6313d3b885bd7b4012b44526
SimHash ea189276abba

Groups

*

Rule Path
Disallow /abonados/
Disallow /europa/
Disallow /europa2001/
Disallow /europa2003/
Disallow /boletin-joven-00135/
Disallow /rsc-00195/
Disallow /enviarnoticia.aspx
Disallow /enviatunoticia.aspx
Disallow /ws/AltaBoletin.ashx
Disallow /buscador.aspx

yandex

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

freshbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

heritrix

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

um-ic

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europapress.es/sitemap.xml
sitemap https://www.europapress.es/sitemap_index/