jornada.com.mx
robots.txt

Robots Exclusion Standard data for jornada.com.mx

Resource Scan

Scan Details

Site Domain jornada.com.mx
Base Domain jornada.com.mx
Scan Status Ok
Last Scan2024-11-10T17:11:51+00:00
Next Scan 2024-11-17T17:11:51+00:00

Last Scan

Scanned2024-11-10T17:11:51+00:00
URL https://jornada.com.mx/robots.txt
Redirect https://www.jornada.com.mx/robots.txt
Redirect Domain www.jornada.com.mx
Redirect Base jornada.com.mx
Domain IPs 64.31.38.83
Redirect IPs 104.22.2.54, 104.22.3.54, 172.67.21.145, 2606:4700:10::6816:236, 2606:4700:10::6816:336, 2606:4700:10::ac43:1591
Response IP 104.22.3.54
Found Yes
Hash 1fc80a21dfa3992588c367b46b03f9655c69d39a3ba1153d184b35ddb221c61e
SimHash 5c083f17adb1

Groups

*

Rule Path
Disallow

*

Rule Path
Allow /ads.txt
Disallow /ads

grapeshot

Rule Path
Disallow

*

Rule Path Comment
Disallow /texto/ no archivar la version texto
Disallow /pda/ no archivar la version para pda
Disallow /nuevo/palm/ no archivar la version para pda
Disallow /ultimas/search no realizar busquedas
Disallow /ultimas/search_form no realizar busquedas
Disallow /cupones-descuento/coupons/ -
Disallow /cupones-descuento/coupons -
Disallow /cupones-descuento/dispatch-shop/ -
Disallow /cupones-descuento/dispatch-coupon/ -
Disallow /cupones-descuento/*?page= -
Disallow /cupones-descuento/shops/*/ratings -
Disallow /cupones-descuento/coupons/*/ratings -
Disallow /cupones-descuento/admin/ -
Disallow /cupones-descuento/admin -

Other Records

Field Value
sitemap https://www.jornada.com.mx/services/sitemap.xml