ichacha.net
robots.txt

Robots Exclusion Standard data for ichacha.net

Resource Scan

Scan Details

Site Domain ichacha.net
Base Domain ichacha.net
Scan Status Ok
Last Scan2024-06-15T05:33:29+00:00
Next Scan 2024-06-22T05:33:29+00:00

Last Scan

Scanned2024-06-15T05:33:29+00:00
URL https://ichacha.net/robots.txt
Redirect https://www.ichacha.net/robots.txt
Redirect Domain www.ichacha.net
Redirect Base ichacha.net
Domain IPs 118.194.253.57
Redirect IPs 118.194.253.57
Response IP 118.194.253.57
Found Yes
Hash d0355434e1afb459bc25f9844e3670531088681ba0ad770964a0e3dbe72152fd
SimHash d314d47c631e

Groups

baiduspider

Rule Path
Disallow /en/
Disallow /people.aspx?*
Disallow /epeople.aspx?*
Disallow /ppl.aspx?*
Disallow /eppl.aspx?*
Disallow /sutf8.aspx?*
Disallow /sgb.aspx?*
Disallow /se.aspx?*
Disallow /hanban.aspx?*
Disallow /en8848.aspx?*
Disallow /liuxue.aspx?*
Disallow /chntravel.aspx?*
Disallow /chntvl.aspx?*
Disallow /search.aspx?*
Disallow /android/m.aspx?*

googlebot

Rule Path
Disallow /people.aspx?*
Disallow /epeople.aspx?*
Disallow /ppl.aspx?*
Disallow /eppl.aspx?*
Disallow /sutf8.aspx?*
Disallow /sgb.aspx?*
Disallow /se.aspx?*
Disallow /hanban.aspx?*
Disallow /en8848.aspx?*
Disallow /liuxue.aspx?*
Disallow /chntravel.aspx?*
Disallow /chntvl.aspx?*
Disallow /fanyi.aspx?*
Disallow /en/
Disallow /android/m.aspx?*

youdaobot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

netseer

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

*

Rule Path
Disallow /learning/
Disallow /mlearning/

Comments

  • User-agent: Slurp
  • Disallow: /
  • User-agent: Slurp
  • Crawl-delay: 2
  • User-agent: Yahoo! Slurp China
  • Crawl-delay: 2
  • User-agent: Yahoo!+Slurp+China
  • Crawl-delay: 2
  • User-agent: Slurp China
  • Crawl-delay: 2
  • User-agent: SogouSpider
  • Disallow: /
  • User-agent: sogou spider
  • Disallow: /
  • User-agent: sogouspider
  • Disallow: /
  • User-agent: Sogou web spider
  • Disallow: /
  • User-agent: Sogou inst spider
  • Disallow: /
  • User-agent: Sogou spider2
  • Disallow: /
  • User-agent: Sogou web spider
  • Disallow: /