arche.com
robots.txt

Robots Exclusion Standard data for arche.com

Resource Scan

Scan Details

Site Domain arche.com
Base Domain arche.com
Scan Status Ok
Last Scan2024-06-04T16:22:23+00:00
Next Scan 2024-07-04T16:22:23+00:00

Last Scan

Scanned2024-06-04T16:22:23+00:00
URL https://arche.com/robots.txt
Domain IPs 104.26.14.178, 104.26.15.178, 172.67.74.183, 2606:4700:20::681a:eb2, 2606:4700:20::681a:fb2, 2606:4700:20::ac43:4ab7
Response IP 104.26.15.178
Found Yes
Hash aaa168114bf8cf33cdd4c52f2b7d3f5b149fd0a5165f57e781cf369c898ea249
SimHash 694c5144cd72

Groups

*

Rule Path
Disallow /*?size=*
Disallow /*?cat=*
Disallow /*customer/account/login/referer/*
Disallow /*stores/store/redirect/___store/*
Disallow /*stores/store/switch/?___from_store=*
Disallow /cdn-cgi/l/email-protection

Other Records

Field Value
sitemap https://www.arche.com/sitemaps.xml