thorntons.com
robots.txt
Robots Exclusion Standard data for thorntons.com
Resource Scan
Scan Details
Site Domain | thorntons.com |
Base Domain | thorntons.com |
Scan Status | Ok |
Last Scan | 2024-09-07T08:08:21+00:00 |
Next Scan | 2024-10-07T08:08:21+00:00 |
Last Scan
Scanned | 2024-09-07T08:08:21+00:00 |
URL | https://thorntons.com/robots.txt |
Redirect | https://www.thorntons.com:443/robots.txt |
Redirect Domain | www.thorntons.com |
Redirect Base | thorntons.com |
Domain IPs | 54.217.72.147, 54.229.192.145, 54.76.9.40 |
Redirect IPs | 125.56.219.2, 23.32.29.89, 2600:1413:b000:1d::17d1:2e91, 2600:1413:b000:1d::17d1:2e94 |
Response IP | 125.56.219.2 |
Found | Yes |
Hash | 84fa7f9f6e2fba7e5804efaeebc4aac712a84692deba378183f8a3705d1354c0 |
SimHash | 7d104f11ed90 |
Groups
*
Rule | Path |
---|---|
Disallow | /uk/en/login/ |
Disallow | /uk/en/cart/ |
Disallow | /uk/en/search/ |
Disallow | /*.pdf$ |
Disallow | /.well-known/ |
Disallow | /assets/ |
Disallow | /uk/en/checkout/ |
Disallow | /uk/en/my-account/ |
Disallow | /uk/en/checkYourOrderPage/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.thorntons.com/sitemap.xml |