thorntons.com
robots.txt

Robots Exclusion Standard data for thorntons.com

Resource Scan

Scan Details

Site Domain thorntons.com
Base Domain thorntons.com
Scan Status Ok
Last Scan2024-09-07T08:08:21+00:00
Next Scan 2024-10-07T08:08:21+00:00

Last Scan

Scanned2024-09-07T08:08:21+00:00
URL https://thorntons.com/robots.txt
Redirect https://www.thorntons.com:443/robots.txt
Redirect Domain www.thorntons.com
Redirect Base thorntons.com
Domain IPs 54.217.72.147, 54.229.192.145, 54.76.9.40
Redirect IPs 125.56.219.2, 23.32.29.89, 2600:1413:b000:1d::17d1:2e91, 2600:1413:b000:1d::17d1:2e94
Response IP 125.56.219.2
Found Yes
Hash 84fa7f9f6e2fba7e5804efaeebc4aac712a84692deba378183f8a3705d1354c0
SimHash 7d104f11ed90

Groups

*

Rule Path
Disallow /uk/en/login/
Disallow /uk/en/cart/
Disallow /uk/en/search/
Disallow /*.pdf$
Disallow /.well-known/
Disallow /assets/
Disallow /uk/en/checkout/
Disallow /uk/en/my-account/
Disallow /uk/en/checkYourOrderPage/

Other Records

Field Value
sitemap https://www.thorntons.com/sitemap.xml