petpost.ca
robots.txt
Robots Exclusion Standard data for petpost.ca
Resource Scan
Scan Details
Site Domain | petpost.ca |
Base Domain | petpost.ca |
Scan Status | Ok |
Last Scan | 2025-10-11T23:35:57+00:00 |
Next Scan | 2025-10-18T23:35:57+00:00 |
Last Scan
Scanned | 2025-10-11T23:35:57+00:00 |
URL | https://petpost.ca/robots.txt |
Domain IPs | 72.167.125.133 |
Response IP | 72.167.125.133 |
Found | Yes |
Hash | fabaa33c5f1a426a9babbe47b6f93202d22614ca3f8c670d95225da8b065451c |
SimHash | 4f4b58520577 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /plugins/ |
Disallow | /libs/ |
Disallow | /includes/ |
Disallow | /print* |
Disallow | /*?sort_by= |
Disallow | /*%26sort_by%3D |
Disallow | /*?sort_type= |
Disallow | /*%26sort_type%3D |
Disallow | /*confirm.html* |
Disallow | /*listing-details.html* |
Disallow | /*print.html* |
Disallow | /404.html* |
Disallow | /*listing-remove.html* |
Disallow | /*pdf-export.html* |
Disallow | /*newsletter.html* |
Other Records
Field | Value |
---|---|
crawl-delay | 20 |
Other Records
Field | Value |
---|---|
sitemap | https://www.petpost.ca/sitemap.xml |
Warnings
- `host` is not a known field.
Comments