okuma.ihh.tw
robots.txt

Robots Exclusion Standard data for okuma.ihh.tw

Resource Scan

Scan Details

Site Domain okuma.ihh.tw
Base Domain ihh.tw
Scan Status Ok
Last Scan2025-11-01T17:58:36+00:00
Next Scan 2025-11-08T17:58:36+00:00

Last Scan

Scanned2025-11-01T17:58:36+00:00
URL https://okuma.ihh.tw/robots.txt
Domain IPs 104.21.78.124, 172.67.220.250, 2606:4700:3032::6815:4e7c, 2606:4700:3032::ac43:dcfa
Response IP 104.21.78.124
Found Yes
Hash 7e2422454ececb109b9cbb97582e6d0fb92d4bbf0c3607a507be18496e87ba4f
SimHash a8852c8fa454

Groups

*

Rule Path
Disallow /admin/
Disallow /user/sign_in
Disallow /cart
Disallow /account

Other Records

Field Value
sitemap https://okuma.ihh.tw/sitemap.xml
sitemap https://okuma.ihh.tw/zh-TW/sitemap.xml
sitemap https://okuma.ihh.tw/en/sitemap.xml
sitemap https://okuma.ihh.tw/ja/sitemap.xml

Comments

  • shop global
  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /