noticiargentina.com.ar
robots.txt

Robots Exclusion Standard data for noticiargentina.com.ar

Resource Scan

Scan Details

Site Domain noticiargentina.com.ar
Base Domain noticiargentina.com.ar
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-31T12:30:11+00:00
Next Scan 2024-08-29T12:30:11+00:00

Last Successful Scan

Scanned2023-11-03T21:34:56+00:00
URL https://noticiargentina.com.ar/robots.txt
Redirect https://www.noticiargentina.com.ar/robots.txt
Redirect Domain www.noticiargentina.com.ar
Redirect Base noticiargentina.com.ar
Domain IPs 138.199.44.209, 185.103.37.72, 185.103.37.73, 51.210.0.138, 51.89.172.162
Redirect IPs 138.199.44.209, 185.103.37.72, 185.103.37.73, 51.210.0.138, 51.89.172.162
Response IP 185.103.37.72
Found Yes
Hash 196d4828576e4ad8989b76fa2f413faee2fb3b1f6313d3b885bd7b4012b44526
SimHash ea189276abba

Groups

*

Rule Path
Disallow /abonados/
Disallow /europa/
Disallow /europa2001/
Disallow /europa2003/
Disallow /boletin-joven-00135/
Disallow /rsc-00195/
Disallow /enviarnoticia.aspx
Disallow /enviatunoticia.aspx
Disallow /ws/AltaBoletin.ashx
Disallow /buscador.aspx

yandex

Rule Path
Disallow /

mail.ru

Rule Path
Disallow /

freshbot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

sentibot

Rule Path
Disallow /

grapeshot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

heritrix

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

um-ic

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.europapress.es/sitemap.xml
sitemap https://www.europapress.es/sitemap_index/