ioh.tw
robots.txt

Robots Exclusion Standard data for ioh.tw

Resource Scan

Scan Details

Site Domain ioh.tw
Base Domain ioh.tw
Scan Status Ok
Last Scan2025-12-20T11:43:15+00:00
Next Scan 2026-01-19T11:43:15+00:00

Last Scan

Scanned2025-12-20T11:43:15+00:00
URL https://ioh.tw/robots.txt
Domain IPs 104.26.14.135, 104.26.15.135, 172.67.72.108, 2606:4700:20::681a:e87, 2606:4700:20::681a:f87, 2606:4700:20::ac43:486c
Response IP 104.26.14.135
Found Yes
Hash 53c2d2656d6b091803f278818ac5249920229fb47fe314dadf3917aafea77b88
SimHash 32850c857d70

Groups

*

Rule Path
Disallow /console$
Disallow /console/*
Disallow /api/*
Disallow /cron/*
Disallow /workshops/*

Other Records

Field Value
sitemap https://uploads.ioh.tw/sitemaps/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-agent: *
  • Disallow: /