heho.com.tw
robots.txt

Robots Exclusion Standard data for heho.com.tw

Resource Scan

Scan Details

Site Domain heho.com.tw
Base Domain heho.com.tw
Scan Status Ok
Last Scan2024-10-11T19:42:44+00:00
Next Scan 2024-10-18T19:42:44+00:00

Last Scan

Scanned2024-10-11T19:42:44+00:00
URL https://heho.com.tw/robots.txt
Domain IPs 34.149.230.38
Response IP 34.149.230.38
Found Yes
Hash 125fb69a31e670a97429934eb5bfb12a227db000580128b5f467a187c808fc0f
SimHash e134d9c0f211

Groups

blexbot

Rule Path
Disallow /page/*

amazonbot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

proximic

Rule Path
Disallow /page/*
Disallow /?s=

Other Records

Field Value
crawl-delay 10

rogerbot

Rule Path
Disallow /page/*
Disallow /?s=

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /ios/*
Disallow /Android/*
Disallow /tuoitre/*
Disallow /idnews/*
Disallow /tin/*
Disallow /24h/*
Disallow /tuc/*
Disallow /news/*
Disallow /live/*
Disallow /download/*
Disallow /app/*
Disallow /?ios%2F*
Disallow /?Android%2F*
Disallow /?app%2F*
Disallow /?download%2F*
Disallow /tag/*
Disallow /?vnnews%2F*
Disallow /?news%2F*
Disallow /?24h%2F*
Disallow /?live%2F*
Disallow /?keyword=*
Disallow /?s=*