papajohns.com.co
robots.txt

Robots Exclusion Standard data for papajohns.com.co

Resource Scan

Scan Details

Site Domain papajohns.com.co
Base Domain papajohns.com.co
Scan Status Ok
Last Scan2024-09-17T04:36:04+00:00
Next Scan 2024-10-17T04:36:04+00:00

Last Scan

Scanned2024-09-17T04:36:04+00:00
URL https://www.papajohns.com.co/robots.txt
Domain IPs 13.33.88.36, 13.33.88.4, 13.33.88.6, 13.33.88.68, 2600:9000:223b:1e00:9:b9ba:9940:93a1, 2600:9000:223b:2400:9:b9ba:9940:93a1, 2600:9000:223b:3400:9:b9ba:9940:93a1, 2600:9000:223b:6000:9:b9ba:9940:93a1, 2600:9000:223b:a00:9:b9ba:9940:93a1, 2600:9000:223b:b000:9:b9ba:9940:93a1, 2600:9000:223b:c400:9:b9ba:9940:93a1, 2600:9000:223b:f400:9:b9ba:9940:93a1
Response IP 13.33.88.36
Found Yes
Hash 2ef7bff6366dc9790735494b63ef1c6d5dbdea3a22fb2fd7665bd47789df3f08
SimHash e638cd074dd0

Groups

*

Rule Path
Disallow /img/*
Disallow /account/*
Disallow /login/*
Disallow /checkout/*
Disallow /busca/*
Disallow /quick-view/*
Disallow /espiar/*
Disallow /buscapagina/*

Other Records

Field Value
sitemap https://www.papajohns.com.co/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.