ma-tsu.com.tw
robots.txt

Robots Exclusion Standard data for ma-tsu.com.tw

Resource Scan

Scan Details

Site Domain ma-tsu.com.tw
Base Domain ma-tsu.com.tw
Scan Status Ok
Last Scan2025-12-29T15:19:49+00:00
Next Scan 2026-01-05T15:19:49+00:00

Last Scan

Scanned2025-12-29T15:19:49+00:00
URL http://ma-tsu.com.tw/robots.txt
Domain IPs 203.69.43.93
Response IP 203.69.43.93
Found Yes
Hash 9b4b354395e0d6ccadbaac0f7da25a5fb2b3fad083c539d2adeea72baef26686
SimHash 991ec200c152

Groups

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

googlebot-news

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

baiduspider

Rule Path
Disallow /

yandexbot

Rule Path
Allow /

sogou spider

Rule Path
Disallow /

duckduckbot

Rule Path
Disallow /

petalbot

Rule Path
Allow /

*

Rule Path
Disallow /

Comments

  • Allow Googlebot (Google's main bot)
  • Allow Googlebot-Image (Google Images)
  • Allow Googlebot-News (Google News)
  • Allow Bingbot (Microsoft Bing)
  • Allow Slurp (Yahoo! Taiwan)
  • Allow Baiduspider (Baidu)
  • Allow YandexBot (Yandex)
  • Allow Sogou spider
  • Allow DuckDuckBot (DuckDuckGo)
  • Allow PetalBot (Huawei)
  • Disallow all other unknown bots