wuhousetw.com
robots.txt
Robots Exclusion Standard data for wuhousetw.com
Resource Scan
Scan Details
Site Domain | wuhousetw.com |
Base Domain | wuhousetw.com |
Scan Status | Ok |
Last Scan | 2024-09-18T11:05:31+00:00 |
Next Scan | 2024-10-18T11:05:31+00:00 |
Last Scan
Scanned | 2024-09-18T11:05:31+00:00 |
URL | https://www.wuhousetw.com/robots.txt |
Domain IPs | 13.35.18.2, 13.35.18.41, 13.35.18.48, 13.35.18.7 |
Response IP | 13.35.18.48 |
Found | Yes |
Hash | a1248b586097d2c1c587aa9d60102321065e0d1238eb15b3dacefcc0cc1371d7 |
SimHash | f36500636316 |
Groups
*
Rule | Path |
---|---|
Allow | /frontend/css/*.css |
Allow | /commons/css/*.css |
Disallow | /loadpage |
Disallow | /order/* |
Disallow | /payment/* |
Disallow | /login/* |
Disallow | /member/* |
Disallow | /cart/fail/* |
Disallow | /currencylang |
Disallow | /search/* |
Disallow | /wp-content/* |
Other Records
Field | Value |
---|---|
sitemap | https://www.wuhousetw.com/sitemap.xml |