kappahl.com
robots.txt
Robots Exclusion Standard data for kappahl.com
Resource Scan
Scan Details
Site Domain | kappahl.com |
Base Domain | kappahl.com |
Scan Status | Ok |
Last Scan | 2025-10-06T20:11:18+00:00 |
Next Scan | 2025-10-20T20:11:18+00:00 |
Last Scan
Scanned | 2025-10-06T20:11:18+00:00 |
URL | https://kappahl.com/robots.txt |
Redirect | https://www.kappahl.com/robots.txt |
Redirect Domain | www.kappahl.com |
Redirect Base | kappahl.com |
Domain IPs | 217.114.94.2, 2a00:1c50:94::2 |
Redirect IPs | 104.16.188.227, 104.16.189.227, 2606:4700::6810:bce3, 2606:4700::6810:bde3 |
Response IP | 104.16.188.227 |
Found | Yes |
Hash | b25561f4b2e14fa824fb2db72dc4c72321dab1680a74b36fd798495960547c57 |
SimHash | 20554c508db3 |
Groups
*
Rule | Path |
---|---|
Disallow | */search |
Disallow | */etsi |
Disallow | */szukaj |
Disallow | */virtual-category-root |
Disallow | /contentassets* |
Disallow | /globalassests* |
Other Records
Field | Value |
---|---|
sitemap | https://www.kappahl.com/sitemap.xml |