hisharp.com.tw
robots.txt

Robots Exclusion Standard data for hisharp.com.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	hisharp.com.tw
Base Domain	hisharp.com.tw
Scan Status	Ok
Last Scan	2026-03-14T23:26:07+00:00
Next Scan	2026-04-13T23:26:07+00:00

Last Scan

Scanned	2026-03-14T23:26:07+00:00
URL	http://hisharp.com.tw/robots.txt
Redirect	https://www.hisharp.com/robots.txt
Redirect Domain	www.hisharp.com
Redirect Base	hisharp.com
Domain IPs	72.18.200.73
Response IP	72.18.200.73
Found	Yes
Hash	69a5a260dcc634aa88089c8cabde1cb10ed862ecb1a801f21ff600ec33d78776
SimHash	d30ec8616f62

Groups

yisouspider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

easouspider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

etaospider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

findlinks

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ezooms

Rule	Path
Disallow	/

Rule

Path

Disallow

/

claudebot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

netestate ne crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	http://www.hisharp.com/sitemap.xml

Field

Value

sitemap

http://www.hisharp.com/sitemap.xml

Back to top

Comments

http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
"TurnitinBot/2.1 (http://www.turnitin.com/robot/crawlerinfo.html)"
"netEstate NE Crawler (+http://www.sengine.info/)"

Back to top

hisharp.com.twrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

yisouspider

easouspider

etaospider

findlinks

ezooms

claudebot

mj12bot

turnitinbot

netestate ne crawler

*

Other Records

Comments

hisharp.com.tw
robots.txt