habook.com
robots.txt

Robots Exclusion Standard data for habook.com

Resource Scan

Scan Details

Site Domain habook.com
Base Domain habook.com
Scan Status Ok
Last Scan2026-03-23T04:40:03+00:00
Next Scan 2026-04-22T04:40:03+00:00

Last Scan

Scanned2026-03-23T04:40:03+00:00
URL https://www.habook.com/robots.txt
Domain IPs 125.227.102.217
Response IP 125.227.102.217
Found Yes
Hash cdc7ba71ebba88fd9eaf118e043121823258375ffc1d184a506afff91520a197
SimHash d30fc8692f62

Groups

yisouspider

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

etaospider

Rule Path
Disallow /

findlinks

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

*

Rule Path
Allow /

Other Records

Field Value
sitemap https://www.habook.com/zh-tw/sitemap.xml
sitemap https://www.habook.com/en/sitemap.xml

Comments

  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • "TurnitinBot/2.1 (http://www.turnitin.com/robot/crawlerinfo.html)"
  • "netEstate NE Crawler (+http://www.sengine.info/)"