netkigyo.weblogs.jp
robots.txt

Robots Exclusion Standard data for netkigyo.weblogs.jp

Resource Scan

Scan Details

Site Domain netkigyo.weblogs.jp
Base Domain weblogs.jp
Scan Status Ok
Last Scan2024-11-03T14:31:05+00:00
Next Scan 2024-12-03T14:31:05+00:00

Last Scan

Scanned2024-11-03T14:31:05+00:00
URL https://netkigyo.weblogs.jp/robots.txt
Domain IPs 104.18.114.121, 104.18.115.121, 104.18.116.121, 104.18.117.121, 104.18.118.121
Response IP 104.18.116.121
Found Yes
Hash 148f2b4bfdb3bf76f2c6afe20930463c750af95d5e1bc6630c76f69989946427
SimHash 6028c270ebb9

Groups

*

Rule Path
Disallow /t/trackback
Disallow /t/comments
Disallow /t/stats
Disallow /t/app
Disallow /.m/

*

Rule Path
Disallow /*.html?cid=*
Disallow /*/comments/page/*
Disallow /*/comments/atom.xml
Disallow /*/comments/rss.xml
Disallow /*/comments/index.rdf

googlebot-mobile

Rule Path
Allow /.m/
Disallow /

y!j-srd

Rule Path
Allow /.m/
Disallow /

y!j-mbs

Rule Path
Allow /.m/
Disallow /

active cache request

Rule Path
Disallow *

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1.0

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

gsa-crawler

Rule Path
Disallow /

twitterbot

Rule Path
Disallow

Comments

  • block against duplicate content
  • block MSIE from abusing cache request