greenhouse.net.tw
robots.txt
Robots Exclusion Standard data for greenhouse.net.tw
Resource Scan
Scan Details
Site Domain | greenhouse.net.tw |
Base Domain | greenhouse.net.tw |
Scan Status | Ok |
Last Scan | 2024-10-17T05:25:04+00:00 |
Next Scan | 2024-11-16T05:25:04+00:00 |
Last Scan
Scanned | 2024-10-17T05:25:04+00:00 |
URL | https://www.greenhouse.net.tw/robots.txt |
Domain IPs | 13.33.30.40, 13.33.30.64, 13.33.30.86, 13.33.30.90 |
Response IP | 13.33.30.86 |
Found | Yes |
Hash | f1f8374bd26bde47e64ba33d089d388d5299959063efa5d9052874890eeafae2 |
SimHash | 215c1f01cdd6 |
Groups
*
Rule | Path |
---|---|
Disallow | /closed |
Disallow | /preview/ |
Disallow | /users/ |
Disallow | /orders |
Disallow | /*?*debug=* |
Disallow | /*?*theme_preview=* |
Disallow | /*?*price_range_preview=* |
Disallow | /*?*draft=* |
Disallow | /api/ |
Disallow | /themes/ |
Disallow | /products*?*query=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.greenhouse.net.tw/sitemap.xml |
Comments