heho.com.tw
robots.txt

Robots Exclusion Standard data for heho.com.tw

Archived Snapshots

Resource Scan

Scan Details

Site Domain	heho.com.tw
Base Domain	heho.com.tw
Scan Status	Ok
Last Scan	2024-10-11T19:42:44+00:00
Next Scan	2024-10-18T19:42:44+00:00

Last Scan

Scanned	2024-10-11T19:42:44+00:00
URL	https://heho.com.tw/robots.txt
Domain IPs	34.149.230.38
Response IP	34.149.230.38
Found	Yes
Hash	125fb69a31e670a97429934eb5bfb12a227db000580128b5f467a187c808fc0f
SimHash	e134d9c0f211

Groups

blexbot

Rule	Path
Disallow	/page/*

Rule

Path

Disallow

/page/*

amazonbot

Rule	Path
Disallow	/

Rule

Path

Disallow

gptbot

Rule	Path
Disallow	/

Rule

Path

Disallow

mj12bot

Rule	Path
Disallow	/

Rule

Path

Disallow

proximic

Rule	Path
Disallow	/page/*
Disallow	/?s=

Rule

Path

Disallow

/page/*

Disallow

/?s=

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

rogerbot

Rule	Path
Disallow	/page/*
Disallow	/?s=

Rule

Path

Disallow

/page/*

Disallow

/?s=

Other Records

Field	Value
crawl-delay	10

Field

Value

crawl-delay

*

Rule	Path
Disallow	/ios/*
Disallow	/Android/*
Disallow	/tuoitre/*
Disallow	/idnews/*
Disallow	/tin/*
Disallow	/24h/*
Disallow	/tuc/*
Disallow	/news/*
Disallow	/live/*
Disallow	/download/*
Disallow	/app/*
Disallow	/?ios%2F*
Disallow	/?Android%2F*
Disallow	/?app%2F*
Disallow	/?download%2F*
Disallow	/tag/*
Disallow	/?vnnews%2F*
Disallow	/?news%2F*
Disallow	/?24h%2F*
Disallow	/?live%2F*
Disallow	/?keyword=*
Disallow	/?s=*

Rule

Path

Disallow

/ios/*

Disallow

/Android/*

Disallow

/tuoitre/*

Disallow

/idnews/*

Disallow

/tin/*

Disallow

/24h/*

Disallow

/tuc/*

Disallow

/news/*

Disallow

/live/*

Disallow

/download/*

Disallow

/app/*

Disallow

/?ios%2F*

Disallow

/?Android%2F*

Disallow

/?app%2F*

Disallow

/?download%2F*

Disallow

/tag/*

Disallow

/?vnnews%2F*

Disallow

/?news%2F*

Disallow

/?24h%2F*

Disallow

/?live%2F*

Disallow

/?keyword=*

Disallow

/?s=*

heho.com.twrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

blexbot

amazonbot

gptbot

mj12bot

proximic

Other Records

rogerbot

Other Records

*

heho.com.tw
robots.txt