petpost.ca
robots.txt

Robots Exclusion Standard data for petpost.ca

Resource Scan

Scan Details

Site Domain petpost.ca
Base Domain petpost.ca
Scan Status Ok
Last Scan2025-10-11T23:35:57+00:00
Next Scan 2025-10-18T23:35:57+00:00

Last Scan

Scanned2025-10-11T23:35:57+00:00
URL https://petpost.ca/robots.txt
Domain IPs 72.167.125.133
Response IP 72.167.125.133
Found Yes
Hash fabaa33c5f1a426a9babbe47b6f93202d22614ca3f8c670d95225da8b065451c
SimHash 4f4b58520577

Groups

*

Rule Path
Allow /
Disallow /plugins/
Disallow /libs/
Disallow /includes/
Disallow /print*
Disallow /*?sort_by=
Disallow /*%26sort_by%3D
Disallow /*?sort_type=
Disallow /*%26sort_type%3D
Disallow /*confirm.html*
Disallow /*listing-details.html*
Disallow /*print.html*
Disallow /404.html*
Disallow /*listing-remove.html*
Disallow /*pdf-export.html*
Disallow /*newsletter.html*

Other Records

Field Value
crawl-delay 20

Other Records

Field Value
sitemap https://www.petpost.ca/sitemap.xml

Comments

  • robots.txt
  • Rules generated by the Sitemap plugin
  • Excluded pages:

Warnings

  • `host` is not a known field.