hisharp.com.tw
robots.txt

Robots Exclusion Standard data for hisharp.com.tw

Resource Scan

Scan Details

Site Domain hisharp.com.tw
Base Domain hisharp.com.tw
Scan Status Ok
Last Scan2026-03-14T23:26:07+00:00
Next Scan 2026-04-13T23:26:07+00:00

Last Scan

Scanned2026-03-14T23:26:07+00:00
URL http://hisharp.com.tw/robots.txt
Redirect https://www.hisharp.com/robots.txt
Redirect Domain www.hisharp.com
Redirect Base hisharp.com
Domain IPs 72.18.200.73
Response IP 72.18.200.73
Found Yes
Hash 69a5a260dcc634aa88089c8cabde1cb10ed862ecb1a801f21ff600ec33d78776
SimHash d30ec8616f62

Groups

yisouspider

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

findlinks

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap http://www.hisharp.com/sitemap.xml

Comments

  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • "TurnitinBot/2.1 (http://www.turnitin.com/robot/crawlerinfo.html)"
  • "netEstate NE Crawler (+http://www.sengine.info/)"