wuhousetw.com
robots.txt

Robots Exclusion Standard data for wuhousetw.com

Resource Scan

Scan Details

Site Domain wuhousetw.com
Base Domain wuhousetw.com
Scan Status Ok
Last Scan2024-09-18T11:05:31+00:00
Next Scan 2024-10-18T11:05:31+00:00

Last Scan

Scanned2024-09-18T11:05:31+00:00
URL https://www.wuhousetw.com/robots.txt
Domain IPs 13.35.18.2, 13.35.18.41, 13.35.18.48, 13.35.18.7
Response IP 13.35.18.48
Found Yes
Hash a1248b586097d2c1c587aa9d60102321065e0d1238eb15b3dacefcc0cc1371d7
SimHash f36500636316

Groups

*

Rule Path
Allow /frontend/css/*.css
Allow /commons/css/*.css
Disallow /loadpage
Disallow /order/*
Disallow /payment/*
Disallow /login/*
Disallow /member/*
Disallow /cart/fail/*
Disallow /currencylang
Disallow /search/*
Disallow /wp-content/*

Other Records

Field Value
sitemap https://www.wuhousetw.com/sitemap.xml