thehanli.com
robots.txt
Robots Exclusion Standard data for thehanli.com
Resource Scan
Scan Details
Site Domain | thehanli.com |
Base Domain | thehanli.com |
Scan Status | Ok |
Last Scan | 2024-11-16T05:53:03+00:00 |
Next Scan | 2024-12-16T05:53:03+00:00 |
Last Scan
Scanned | 2024-11-16T05:53:03+00:00 |
URL | https://www.thehanli.com/robots.txt |
Domain IPs | 108.157.254.111, 108.157.254.120, 108.157.254.27, 108.157.254.78 |
Response IP | 108.157.254.111 |
Found | Yes |
Hash | 9904090eb1190a5d123b892966b6a099133a19ef43aaae7030de2737220ce0bd |
SimHash | 215c1f01cdd6 |
Groups
*
Rule | Path |
---|---|
Disallow | /closed |
Disallow | /preview/ |
Disallow | /users/ |
Disallow | /orders |
Disallow | /*?*debug=* |
Disallow | /*?*theme_preview=* |
Disallow | /*?*price_range_preview=* |
Disallow | /*?*draft=* |
Disallow | /api/ |
Disallow | /themes/ |
Disallow | /products*?*query=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.thehanli.com/sitemap.xml |
Comments