isuperman.tw
robots.txt
Robots Exclusion Standard data for isuperman.tw
Resource Scan
Scan Details
Site Domain | isuperman.tw |
Base Domain | isuperman.tw |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2025-09-18T17:36:14+00:00 |
Next Scan | 2025-12-17T17:36:14+00:00 |
Last Successful Scan
Scanned | 2025-05-22T03:30:51+00:00 |
URL | https://isuperman.tw/robots.txt |
Redirect | https://www.isuperman.tw/robots.txt |
Redirect Domain | www.isuperman.tw |
Redirect Base | isuperman.tw |
Domain IPs | 104.26.6.14, 104.26.7.14, 172.67.68.152, 2606:4700:20::681a:60e, 2606:4700:20::681a:70e, 2606:4700:20::ac43:4498 |
Redirect IPs | 104.26.6.14, 104.26.7.14, 172.67.68.152, 2606:4700:20::681a:60e, 2606:4700:20::681a:70e, 2606:4700:20::ac43:4498 |
Response IP | 104.26.7.14 |
Found | Yes |
Hash | 03c19e5796819c727c9f6b03df790dc48205bfca0fea50c99c2c7b56aed1e524 |
SimHash | a725d650e610 |
Groups
*
Rule | Path |
---|---|
Disallow | /cdn-cgi/ |
Disallow | /*add-to-cart%3D* |
*
Rule | Path |
---|---|
Disallow | /?s= |
Disallow | /search |
Other Records
Field | Value |
---|---|
crawl-delay | 3 |
Other Records
Field | Value |
---|---|
sitemap | https://isuperman.tw/sitemap.xml |
Comments