justcandy.com
robots.txt

Robots Exclusion Standard data for justcandy.com

Resource Scan

Scan Details

Site Domain justcandy.com
Base Domain justcandy.com
Scan Status Ok
Last Scan2024-05-18T08:22:32+00:00
Next Scan 2024-06-17T08:22:32+00:00

Last Scan

Scanned2024-05-18T08:22:32+00:00
URL https://justcandy.com/robots.txt
Redirect https://www.justcandy.com/robots.txt
Redirect Domain www.justcandy.com
Redirect Base justcandy.com
Domain IPs 18.161.6.114, 18.161.6.127, 18.161.6.128, 18.161.6.46
Redirect IPs 2600:9000:23d2:3e00:8:490:1c80:93a1, 2600:9000:23d2:4a00:8:490:1c80:93a1, 2600:9000:23d2:5200:8:490:1c80:93a1, 2600:9000:23d2:6400:8:490:1c80:93a1, 2600:9000:23d2:7200:8:490:1c80:93a1, 2600:9000:23d2:c000:8:490:1c80:93a1, 2600:9000:23d2:dc00:8:490:1c80:93a1, 2600:9000:23d2:f200:8:490:1c80:93a1, 54.192.18.31, 54.192.18.56, 54.192.18.8, 54.192.18.90
Response IP 18.155.68.86
Found Yes
Hash 43df9451d31116fce6956dedbb8cd0155a9838d0c64097a4f529396f8d8267dd
SimHash 2e90ab96cfb9

Groups

adsbot-google

Rule Path
Allow /

adsbot-google-mobile

Rule Path
Allow /

googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /login
Disallow /wishlist
Disallow /checkout
Disallow /account
Disallow /img/*
Disallow /*map%3Dft
Disallow beta.justcandy.com

Other Records

Field Value
sitemap https://www.justcandy.com/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.

Warnings

  • 2 invalid lines.