whc.ca
robots.txt

Robots Exclusion Standard data for whc.ca

Resource Scan

Scan Details

Site Domain whc.ca
Base Domain whc.ca
Scan Status Ok
Last Scan5/9/2025, 10:39:57 PM
Next Scan 5/23/2025, 10:39:57 PM

Last Scan

Scanned5/9/2025, 10:39:57 PM
URL https://whc.ca/robots.txt
Domain IPs 104.26.6.2, 104.26.7.2, 172.67.70.182, 2606:4700:20::681a:602, 2606:4700:20::681a:702, 2606:4700:20::ac43:46b6
Response IP 104.26.7.2
Found Yes
Hash 60a0a4d58cbfec7c0e3d331c2a5293629771d79b2de091d0ae24938219e31074
SimHash 194490668fb2

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php
Disallow /category/english-categories
Disallow /main/en/all-products/
Disallow /us-new-dev/
Disallow /css-modules/
Disallow /dev-homepage/
Disallow /hebergement-web-illimite-dev/
Disallow /main/fr/produits/
Disallow /main/en/all-products/
Disallow /main/fr/produits/autres/
Disallow /main/en/all-products/other/
Disallow /web-hosting-english-dev/
Disallow /sites-wordpress/
Disallow /footer__trashed/en/
Disallow /footer__trashed/fr/
Disallow /blog/fr-OLD
Disallow /blog/en-OLD
Disallow /backorder-remy