isuperman.tw
robots.txt

Robots Exclusion Standard data for isuperman.tw

Resource Scan

Scan Details

Site Domain isuperman.tw
Base Domain isuperman.tw
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-18T17:36:14+00:00
Next Scan 2025-12-17T17:36:14+00:00

Last Successful Scan

Scanned2025-05-22T03:30:51+00:00
URL https://isuperman.tw/robots.txt
Redirect https://www.isuperman.tw/robots.txt
Redirect Domain www.isuperman.tw
Redirect Base isuperman.tw
Domain IPs 104.26.6.14, 104.26.7.14, 172.67.68.152, 2606:4700:20::681a:60e, 2606:4700:20::681a:70e, 2606:4700:20::ac43:4498
Redirect IPs 104.26.6.14, 104.26.7.14, 172.67.68.152, 2606:4700:20::681a:60e, 2606:4700:20::681a:70e, 2606:4700:20::ac43:4498
Response IP 104.26.7.14
Found Yes
Hash 03c19e5796819c727c9f6b03df790dc48205bfca0fea50c99c2c7b56aed1e524
SimHash a725d650e610

Groups

*

Rule Path
Disallow /cdn-cgi/
Disallow /*add-to-cart%3D*

*

Rule Path
Disallow /?s=
Disallow /search

Other Records

Field Value
crawl-delay 3

Other Records

Field Value
sitemap https://isuperman.tw/sitemap.xml

Comments

  • Prevent Crawling Unnecessary Endpoints - Dynamically added by BigScoots