papajohns.com.co
robots.txt

Robots Exclusion Standard data for papajohns.com.co

Resource Scan

Scan Details

Site Domain papajohns.com.co
Base Domain papajohns.com.co
Scan Status Ok
Last Scan2024-11-16T06:22:53+00:00
Next Scan 2024-12-16T06:22:53+00:00

Last Scan

Scanned2024-11-16T06:22:53+00:00
URL https://www.papajohns.com.co/robots.txt
Domain IPs 13.33.88.36, 13.33.88.4, 13.33.88.6, 13.33.88.68, 2600:9000:223b:2200:9:b9ba:9940:93a1, 2600:9000:223b:2c00:9:b9ba:9940:93a1, 2600:9000:223b:8c00:9:b9ba:9940:93a1, 2600:9000:223b:9a00:9:b9ba:9940:93a1, 2600:9000:223b:b400:9:b9ba:9940:93a1, 2600:9000:223b:d400:9:b9ba:9940:93a1, 2600:9000:223b:f000:9:b9ba:9940:93a1, 2600:9000:223b:fe00:9:b9ba:9940:93a1
Response IP 13.33.88.4
Found Yes
Hash 2ef7bff6366dc9790735494b63ef1c6d5dbdea3a22fb2fd7665bd47789df3f08
SimHash e638cd074dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*
Disallow /buscapagina/*

Other Records

Field Value
sitemap https://www.papajohns.com.co/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.