i.ncku.edu.tw
robots.txt

Robots Exclusion Standard data for i.ncku.edu.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	i.ncku.edu.tw
Base Domain	ncku.edu.tw
Scan Status	Ok
Last Scan	2025-11-22T05:05:34+00:00
Next Scan	2025-12-22T05:05:34+00:00

Last Scan

Scanned	2025-11-22T05:05:34+00:00
URL	https://i.ncku.edu.tw/robots.txt
Domain IPs	140.116.249.160
Response IP	140.116.249.160
Found	Yes
Hash	528b900414605a72292125e2a0909e67aaded874e9482ce1c0c8dcf657300118
SimHash	38101d18c574

Groups

*

Rule	Path
Disallow	/includ
Disallow	/mis
Disallow	/module
Disallow	/profile
Disallow	/script
Disallow	/theme
Disallow	/CHANGELOG.txt
Disallow	/cron.php
Disallow	/INSTALL.mysql.txt
Disallow	/INSTALL.pgsql.txt
Disallow	/INSTALL.sqlite.txt
Disallow	/install.php
Disallow	/INSTALL.txt
Disallow	/LICENSE.txt
Disallow	/MAINTAINERS.txt
Disallow	/update.php
Disallow	/UPGRADE.txt
Disallow	/xmlrpc.php
Disallow	/admin
Disallow	/comment/reply
Disallow	/filter/tip
Disallow	/node/add
Disallow	/sear
Disallow	/user/regist
Disallow	/user/password
Disallow	/user/log
Disallow	/?q=admin
Disallow	/?q=comment%2Frep
Disallow	/?q=filter%2Ftip
Disallow	/?q=node%2Fad
Disallow	/?q=sear
Disallow	/?q=user%2Fpasswor
Disallow	/?q=user%2Fregist
Disallow	/?q=user%2Flog
Disallow	/?q=user%2Flogo
Disallow	/?q=flag
Disallow	/flag
Disallow	/en/flag
Disallow	/zh-hant/flag
Disallow	/zh-hant/flag/

Rule

Path

Disallow

/includ

Disallow

/mis

Disallow

/module

Disallow

/profile

Disallow

/script

Disallow

/theme

Disallow

/CHANGELOG.txt

Disallow

/cron.php

Disallow

/INSTALL.mysql.txt

Disallow

/INSTALL.pgsql.txt

Disallow

/INSTALL.sqlite.txt

Disallow

/install.php

Disallow

/INSTALL.txt

Disallow

/LICENSE.txt

Disallow

/MAINTAINERS.txt

Disallow

/update.php

Disallow

/UPGRADE.txt

Disallow

/xmlrpc.php

Disallow

/admin

Disallow

/comment/reply

Disallow

/filter/tip

Disallow

/node/add

Disallow

/sear

Disallow

/user/regist

Disallow

/user/password

Disallow

/user/log

Disallow

/?q=admin

Disallow

/?q=comment%2Frep

Disallow

/?q=filter%2Ftip

Disallow

/?q=node%2Fad

Disallow

/?q=sear

Disallow

/?q=user%2Fpasswor

Disallow

/?q=user%2Fregist

Disallow

/?q=user%2Flog

Disallow

/?q=user%2Flogo

Disallow

/?q=flag

Disallow

/flag

Disallow

/en/flag

Disallow

/zh-hant/flag

Disallow

/zh-hant/flag/

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

10

Back to top

Comments

robots.txt
This file is to prevent the crawling and indexing of certain parts
of your site by web crawlers and spiders run by sites like Yahoo!
and Google. By telling these "robots" where not to go on your site,
you save bandwidth and server resources.
This file will be ignored unless it is at the root of your host:
Used: http://example.com/robots.txt
Ignored: http://example.com/site/robots.txt
For more information about the robots.txt standard, see:
http://www.robotstxt.org/robotstxt.html
Directories
Files
Paths (clean URLs)
Paths (no clean URLs)

Back to top

i.ncku.edu.twrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

i.ncku.edu.tw
robots.txt