eltribunodejujuy.com
robots.txt

Robots Exclusion Standard data for eltribunodejujuy.com

Resource Scan

Scan Details

Site Domain eltribunodejujuy.com
Base Domain eltribunodejujuy.com
Scan Status Ok
Last Scan2024-10-08T15:16:51+00:00
Next Scan 2024-10-15T15:16:51+00:00

Last Scan

Scanned2024-10-08T15:16:51+00:00
URL https://eltribunodejujuy.com/robots.txt
Domain IPs 104.21.82.85, 172.67.155.110, 2606:4700:3037::6815:5255, 2606:4700:3037::ac43:9b6e
Response IP 104.21.82.85
Found Yes
Hash dfe2413f96b2efef9c4aaf880d51289376ffffb7a52beb5ff5c3364f2d3862e7
SimHash 00009a112570

Groups

*

Rule Path
Allow /
Disallow /partido_detalle/
Disallow /portadas/
Disallow /b/
Disallow /suscripcion_web/login
Disallow /suscripcion_web/mi-cuenta
Disallow /suscripcion_web/register
Disallow /suscripcion_web/cerrar-sesion-ok
Disallow /*.pdf$
Disallow /*?utm_*

google-extended

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

piplbot

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://eltribunodejujuy.com/sitemap.xml
sitemap https://eltribunodejujuy.com/sitemap_lite.xml
sitemap https://eltribunodejujuy.com/sitemap-news.xml
sitemap https://eltribunodejujuy.com/sitemap-organico.xml
sitemap https://eltribunodejujuy.com/sitemap-secciones.xml

Comments

  • robots.txt file for https://eltribunodejujuy.com *
  • Last updated: 21/05/2024