icsa.org.tw
robots.txt

Robots Exclusion Standard data for icsa.org.tw

Resource Scan

Scan Details

Site Domain icsa.org.tw
Base Domain icsa.org.tw
Scan Status Ok
Last Scan2026-02-07T14:23:54+00:00
Next Scan 2026-03-09T14:23:54+00:00

Last Scan

Scanned2026-02-07T14:23:54+00:00
URL http://icsa.org.tw/robots.txt
Redirect http://www.icsa.org.tw/robots.txt
Redirect Domain www.icsa.org.tw
Redirect Base icsa.org.tw
Domain IPs 199.34.228.58
Redirect IPs 199.34.228.58
Response IP 199.34.228.58
Found Yes
Hash 5502f35752d0571402180d6120065803c71b366a8c20241e9c34019f9ffbc61f
SimHash 2a540cf06fb3

Groups

nerdybot

Rule Path
Disallow /

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ajax/
Disallow /apps/
Disallow /33775304313177720171.html
Disallow /http%3A//www.icsa.org.tw/class2014.html
Disallow /http%3A//www.icsa.org.tw/3377530431263762036326371.html
Disallow /2019719979327692410726410201842418036027.html
Disallow /24180242302930521002332873387920316.html
Disallow /20363263713200037636.html
Disallow /33775304312010721209.html
Disallow /265192038128500-3561124107.html
Disallow /340812983520094-3561124107.html
Disallow /215553826326032-3561124107.html
Disallow /329933143538525-3561124107.html
Disallow /243733105624247-3561124107.html
Disallow /406542964020278-3561124107.html
Disallow /264462816521644-3561124107.html
Disallow /264462350022478-3561124107.html
Disallow /264463606029645-3561124107.html
Disallow /241202539122283-3561124107.html
Disallow /335392004334425-3561124107.html

Other Records

Field Value
sitemap http://www.icsa.org.tw/sitemap.xml