papajohns.com.gt
robots.txt

Robots Exclusion Standard data for papajohns.com.gt

Resource Scan

Scan Details

Site Domain papajohns.com.gt
Base Domain papajohns.com.gt
Scan Status Ok
Last Scan2024-05-21T02:16:15+00:00
Next Scan 2024-06-04T02:16:15+00:00

Last Scan

Scanned2024-05-21T02:16:15+00:00
URL https://papajohns.com.gt/robots.txt
Redirect https://www.papajohns.com.gt/robots.txt
Redirect Domain www.papajohns.com.gt
Redirect Base papajohns.com.gt
Domain IPs 13.35.18.56, 13.35.18.63, 13.35.18.65, 13.35.18.72
Redirect IPs 13.33.88.101, 13.33.88.24, 13.33.88.83, 13.33.88.91
Response IP 13.33.88.101
Found Yes
Hash 79763f13cd2f41edb3d068875abca27b455693466e93fea70bd084aec9862c65
SimHash 259018504e82

Groups

*

Rule Path
Disallow /recuperar-contrasena
Disallow /nps
Disallow /perfil
Disallow /checkout
Disallow /compra-exitosa
Disallow /compra-en-revision
Disallow /compra-rechazada

adsbot-google

Rule Path
Disallow

googlebot

Rule Path
Allow /*%3Dmaps*
Allow /*%3Dorganic*
Disallow /*category%3D*
Disallow /*gclid%3D*
Disallow /*%26utm*
Disallow /*p%3D*

*

Rule Path
Allow /*.css$
Allow /*.js$

Other Records

Field Value
sitemap https://www.papajohns.com.gt/sitemap.xml
sitemap https://www.papajohns.com.gt/sitemap-stores.xml
sitemap https://www.papajohns.com.gt/pizzas/sitemap.xml

Comments

  • robots.txt de https://www.papajohns.com.gt/
  • Previene problemas de recursos bloqueados