guerlain.com
robots.txt
Robots Exclusion Standard data for guerlain.com
Resource Scan
Scan Details
Site Domain | guerlain.com |
Base Domain | guerlain.com |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-11-04T06:50:23+00:00 |
Next Scan | 2024-12-04T06:50:23+00:00 |
Last Successful Scan
Scanned | 2024-09-12T22:07:20+00:00 |
URL | https://guerlain.com/robots.txt |
Redirect | https://www.guerlain.com/robots.txt |
Redirect Domain | www.guerlain.com |
Redirect Base | guerlain.com |
Domain IPs | 40.115.38.83 |
Redirect IPs | 23.215.7.16, 23.215.7.5 |
Response IP | 125.56.219.2 |
Found | Yes |
Hash | 490afbc45d23eb3fe4f7dcc3b3fc6b4c486d0cf54e829a033b838ab8b8fbf796 |
SimHash | b400aa0a0fb5 |
Groups
*
Rule | Path |
---|---|
Disallow | *wishlist* |
Disallow | */Wishlist-Add* |
Disallow | */account* |
Disallow | */error* |
Disallow | */on/demandware.store/* |
Disallow | *prefn* |
Disallow | *prefv* |
Disallow | *srule* |
Disallow | *pmin* |
Disallow | *cgid* |
Disallow | *pid* |
Disallow | *dwcont* |
Disallow | *dwvar* |
Disallow | *format* |
Disallow | *?start=* |
Disallow | *?home* |
Disallow | /*?redirectAfterLogin |
Disallow | /*search?ID= |
Disallow | /*?pp= |
Disallow | /*?ppex=true |
Disallow | /*?site= |
Disallow | /*search?q= |
Disallow | /*l?gclid= |
Disallow | /*?customizationType=personalization&gpBottleSize |
Disallow | /*?customizationType=personalizationbyoption |
Disallow | */search |