sg.puma.com
robots.txt
Robots Exclusion Standard data for sg.puma.com
Resource Scan
Scan Details
Site Domain | sg.puma.com |
Base Domain | puma.com |
Scan Status | Ok |
Last Scan | 2024-09-15T02:21:54+00:00 |
Next Scan | 2024-10-15T02:21:54+00:00 |
Last Scan
Scanned | 2024-09-15T02:21:54+00:00 |
URL | https://sg.puma.com/robots.txt |
Domain IPs | 104.18.244.89, 104.18.245.89 |
Response IP | 104.18.245.89 |
Found | Yes |
Hash | ab0dde637c87d0455e79c92ea4a793839183a76712497de4441e458ab22edd74 |
SimHash | 235bc804effb |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /fr_FR/ |
Disallow | /*pmin* |
Disallow | /*pmax* |
Disallow | /*prefn1* |
Disallow | /*prefn2* |
Disallow | /*prefn3* |
Disallow | /*prefn4* |
Disallow | /*prefv1* |
Disallow | /*prefv2* |
Disallow | /*prefv3* |
Disallow | /*prefv4* |
Disallow | *Wishlist-* |
Disallow | *Cart-Show* |
Disallow | *Order-History* |
msnbot
Rule | Path |
---|---|
Disallow | *Wishlist-* |
Disallow | *Cart-Show* |
Disallow | *Order-History* |
Other Records
Field | Value |
---|---|
crawl-delay | 30 |
Other Records
Field | Value |
---|---|
sitemap | https://sg.puma.com/sitemap_index.xml |
Warnings
- 2 invalid lines.