tw.ichacha.net
robots.txt

Robots Exclusion Standard data for tw.ichacha.net

Resource Scan

Scan Details

Site Domain tw.ichacha.net
Base Domain ichacha.net
Scan Status Ok
Last Scan2026-02-07T11:10:43+00:00
Next Scan 2026-02-14T11:10:43+00:00

Last Scan

Scanned2026-02-07T11:10:43+00:00
URL https://tw.ichacha.net/robots.txt
Domain IPs 104.21.40.92, 172.67.183.72, 2606:4700:3034::6815:285c, 2606:4700:3037::ac43:b748
Response IP 172.67.183.72
Found Yes
Hash 4bae084b5853baab69b8b659a91f7be978c928863505db6013d0afcfc460ec61
SimHash 76a0bcfc655e

Groups

googlebot
bingbot

Rule Path
Disallow /people.aspx?*
Disallow /epeople.aspx?*
Disallow /ppl.aspx?*
Disallow /eppl.aspx?*
Disallow /sutf8.aspx?*
Disallow /sgb.aspx?*
Disallow /se.aspx?*
Disallow /hanban.aspx?*
Disallow /en8848.aspx?*
Disallow /liuxue.aspx?*
Disallow /chntravel.aspx?*
Disallow /chntvl.aspx?*
Disallow /fanyi.aspx?*
Disallow /search.aspx?*
Disallow /amp.aspx?*
Disallow /en/
Disallow /am/
Disallow /amjp/
Disallow /amkr/
Disallow /amfr/
Disallow /amru/
Disallow /amhy/
Disallow /amzj/
Disallow /amfy/
Disallow /amjy/
Disallow /ru/
Disallow /mru/
Disallow /id/
Disallow /mid/
Disallow /ar/
Disallow /mar/
Disallow /search.aspx?*
Disallow /m.aspx?*

youdaobot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

Comments

  • User-agent: Yahoo! Slurp China
  • Crawl-delay: 2
  • User-agent: Yahoo!+Slurp+China
  • Crawl-delay: 2
  • User-agent: Slurp China
  • Crawl-delay: 2