in.puma.com
robots.txt
Robots Exclusion Standard data for in.puma.com
Resource Scan
Scan Details
Site Domain | in.puma.com |
Base Domain | puma.com |
Scan Status | Ok |
Last Scan | 2024-11-08T19:20:26+00:00 |
Next Scan | 2024-11-22T19:20:26+00:00 |
Last Scan
Scanned | 2024-11-08T19:20:26+00:00 |
URL | https://in.puma.com/robots.txt |
Domain IPs | 151.101.130.132, 151.101.194.132, 151.101.2.132, 151.101.66.132 |
Response IP | 151.101.2.132 |
Found | Yes |
Hash | d758704a90338ddf7bb8fd14e5b2069fc15f62084b8f7dc993977a456c97f5a0 |
SimHash | 530c5864e512 |
Groups
*
Rule | Path |
---|---|
Disallow | *srule |
Disallow | *cgid |
Disallow | *demandware |
Disallow | *search |
Disallow | *from%3D |
Disallow | *q%3D |
Disallow | *start%3D |
Disallow | *search%3D |
Disallow | *offset%3D |
Disallow | *color%3D |
Disallow | *style%3D |
Disallow | *sport%3D |
Disallow | *team%3D |
Disallow | *pmax%3D |
Disallow | *wishlist%3D |
Disallow | *sort%3D |
Disallow | *pmin%3D |
Disallow | /api/getUserLocation |
Disallow | /*/wishlist/ |
Allow | *.js |
Allow | *.svg |
Allow | *graphql |
Other Records
Field | Value |
---|---|
sitemap | https://in.puma.com/assets/sitemaps/in/sitemap_index.xml |