habook.com
robots.txt

Robots Exclusion Standard data for habook.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	habook.com
Base Domain	habook.com
Scan Status	Ok
Last Scan	2026-03-23T04:40:03+00:00
Next Scan	2026-04-22T04:40:03+00:00

Last Scan

Scanned	2026-03-23T04:40:03+00:00
URL	https://www.habook.com/robots.txt
Domain IPs	125.227.102.217
Response IP	125.227.102.217
Found	Yes
Hash	cdc7ba71ebba88fd9eaf118e043121823258375ffc1d184a506afff91520a197
SimHash	d30fc8692f62

Groups

yisouspider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

easouspider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

etaospider

Rule	Path
Disallow	/

Rule

Path

Disallow

/

findlinks

Rule	Path
Disallow	/

Rule

Path

Disallow

/

ezooms

Rule	Path
Disallow	/

Rule

Path

Disallow

/

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

turnitinbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

netestate ne crawler

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Allow	/

Rule

Path

Allow

/

Back to top

Other Records

Field	Value
sitemap	https://www.habook.com/zh-tw/sitemap.xml
sitemap	https://www.habook.com/en/sitemap.xml

Field

Value

sitemap

https://www.habook.com/zh-tw/sitemap.xml

sitemap

https://www.habook.com/en/sitemap.xml

Back to top

Comments

http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
"TurnitinBot/2.1 (http://www.turnitin.com/robot/crawlerinfo.html)"
"netEstate NE Crawler (+http://www.sengine.info/)"

Back to top

habook.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

yisouspider

easouspider

etaospider

findlinks

ezooms

mj12bot

turnitinbot

netestate ne crawler

*

Other Records

Comments

habook.com
robots.txt