ja.ichacha.net
robots.txt

Robots Exclusion Standard data for ja.ichacha.net

Resource Scan

Scan Details

Site Domain ja.ichacha.net
Base Domain ichacha.net
Scan Status Ok
Last Scan2024-09-19T21:27:53+00:00
Next Scan 2024-09-26T21:27:53+00:00

Last Scan

Scanned2024-09-19T21:27:53+00:00
URL https://ja.ichacha.net/robots.txt
Domain IPs 118.194.231.229
Response IP 118.194.231.229
Found Yes
Hash fb83d6ca24e21b0f882307d5d54af7ebeeb6abe71e5708701a9df24699c88282
SimHash d324bcfc679a

Groups

baiduspider

Rule Path
Disallow /en/
Disallow /people.aspx?*
Disallow /epeople.aspx?*
Disallow /ppl.aspx?*
Disallow /eppl.aspx?*
Disallow /sutf8.aspx?*
Disallow /sgb.aspx?*
Disallow /se.aspx?*
Disallow /hanban.aspx?*
Disallow /en8848.aspx?*
Disallow /liuxue.aspx?*
Disallow /chntravel.aspx?*
Disallow /chntvl.aspx?*
Disallow /search.aspx?*
Disallow /android/m.aspx?*
Disallow /gbk.aspx?*
Disallow /amp.aspx?*
Disallow /m.aspx?*
Disallow /amp.aspx?*
Disallow /en/
Disallow /am/
Disallow /amjp/
Disallow /amkr/
Disallow /amfr/
Disallow /amru/
Disallow /amhy/
Disallow /amzj/
Disallow /amfy/
Disallow /amjy/

googlebot

Rule Path
Disallow /people.aspx?*
Disallow /epeople.aspx?*
Disallow /ppl.aspx?*
Disallow /eppl.aspx?*
Disallow /sutf8.aspx?*
Disallow /sgb.aspx?*
Disallow /se.aspx?*
Disallow /hanban.aspx?*
Disallow /en8848.aspx?*
Disallow /liuxue.aspx?*
Disallow /chntravel.aspx?*
Disallow /chntvl.aspx?*
Disallow /fanyi.aspx?*
Disallow /en/
Disallow /android/m.aspx?*
Disallow /gbk.aspx?*
Disallow /amp.aspx?*
Disallow /m.aspx?*
Disallow /amp.aspx?*
Disallow /en/
Disallow /am/
Disallow /amjp/
Disallow /amkr/
Disallow /amfr/
Disallow /amru/
Disallow /amhy/
Disallow /amzj/
Disallow /amfy/
Disallow /amjy/

youdaobot

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

netseer

Rule Path
Disallow /

proximic

Rule Path
Disallow /

sogou blog

Rule Path
Disallow /

sogou news spider

Rule Path
Disallow /

sogou orion spider

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://ja.ichacha.net/sm.ja/idx.xml
sitemap https://ja.ichacha.net/sm.ej.ja/idx.xml
sitemap https://ja.ichacha.net/sm.ej.en/idx.xml
sitemap https://ja.ichacha.net/sm.ej.zaoju.eng/idx.xml
sitemap https://ja.ichacha.net/sm.cj.hy/idx.xml
sitemap https://ja.ichacha.net/sm.cj.zh/idx.xml
sitemap https://ja.ichacha.net/sm.ej.fayin/idx.xml

Comments

  • User-agent: Slurp
  • Disallow: /
  • User-agent: Slurp
  • Crawl-delay: 2
  • User-agent: Yahoo! Slurp China
  • Crawl-delay: 2
  • User-agent: Yahoo!+Slurp+China
  • Crawl-delay: 2
  • User-agent: Slurp China
  • Crawl-delay: 2
  • User-agent: bingbot
  • Disallow: /
  • User-agent: SogouSpider
  • Disallow: /
  • User-agent: sogou spider
  • Disallow: /
  • User-agent: sogouspider
  • Disallow: /
  • User-agent: Sogou web spider
  • Disallow: /
  • User-agent: Sogou inst spider
  • Disallow: /
  • User-agent: Sogou spider2
  • Disallow: /
  • User-agent: Sogou web spider
  • Disallow: /