heho.com.tw
robots.txt

Robots Exclusion Standard data for heho.com.tw

Resource Scan

Scan Details

Site Domain heho.com.tw
Base Domain heho.com.tw
Scan Status Ok
Last Scan2026-03-05T17:37:33+00:00
Next Scan 2026-03-12T17:37:33+00:00

Last Scan

Scanned2026-03-05T17:37:33+00:00
URL https://heho.com.tw/robots.txt
Domain IPs 34.149.230.38
Response IP 34.149.230.38
Found Yes
Hash 8dfebf627103e51e8950f9e4513f3b9323176c47a8ccc9bdfb871cdd3370c762
SimHash e334c9c0a200

Groups

blexbot

Rule Path
Disallow /page/*

mj12bot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

criteobot

Rule Path
Disallow /

proximic

Rule Path
Disallow /page/*
Disallow /?s=

Other Records

Field Value
crawl-delay 10

rogerbot

Rule Path
Disallow /page/*
Disallow /?s=

Other Records

Field Value
crawl-delay 10

seekport

Rule Path
Disallow /?keyword=
Disallow /?s=

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ios/*
Disallow /Android/*
Disallow /tuoitre/*
Disallow /idnews/*
Disallow /tin/*
Disallow /24h/*
Disallow /tuc/*
Disallow /news/*
Disallow /live/*
Disallow /download/*
Disallow /app/*
Disallow /?ios%2F*
Disallow /?Android%2F*
Disallow /?app%2F*
Disallow /?download%2F*
Disallow /tag/*
Disallow /?vnnews%2F*
Disallow /?news%2F*
Disallow /?24h%2F*
Disallow /?live%2F*
Disallow /?keyword=*
Disallow /?s=*

bingbot

Rule Path
Disallow /search/

criteobot

Rule Path
Disallow /