guerlain.com
robots.txt

Robots Exclusion Standard data for guerlain.com

Resource Scan

Scan Details

Site Domain guerlain.com
Base Domain guerlain.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-11-04T06:50:23+00:00
Next Scan 2024-12-04T06:50:23+00:00

Last Successful Scan

Scanned2024-09-12T22:07:20+00:00
URL https://guerlain.com/robots.txt
Redirect https://www.guerlain.com/robots.txt
Redirect Domain www.guerlain.com
Redirect Base guerlain.com
Domain IPs 40.115.38.83
Redirect IPs 23.215.7.16, 23.215.7.5
Response IP 125.56.219.2
Found Yes
Hash 490afbc45d23eb3fe4f7dcc3b3fc6b4c486d0cf54e829a033b838ab8b8fbf796
SimHash b400aa0a0fb5

Groups

*

Rule Path
Disallow *wishlist*
Disallow */Wishlist-Add*
Disallow */account*
Disallow */error*
Disallow */on/demandware.store/*
Disallow *prefn*
Disallow *prefv*
Disallow *srule*
Disallow *pmin*
Disallow *cgid*
Disallow *pid*
Disallow *dwcont*
Disallow *dwvar*
Disallow *format*
Disallow *?start=*
Disallow *?home*
Disallow /*?redirectAfterLogin
Disallow /*search?ID=
Disallow /*?pp=
Disallow /*?ppex=true
Disallow /*?site=
Disallow /*search?q=
Disallow /*l?gclid=
Disallow /*?customizationType=personalization&gpBottleSize
Disallow /*?customizationType=personalizationbyoption
Disallow */search