soumunomori.com
robots.txt

Robots Exclusion Standard data for soumunomori.com

Resource Scan

Scan Details

Site Domain soumunomori.com
Base Domain soumunomori.com
Scan Status Ok
Last Scan2024-09-20T12:58:52+00:00
Next Scan 2024-09-27T12:58:52+00:00

Last Scan

Scanned2024-09-20T12:58:52+00:00
URL https://www.soumunomori.com/robots.txt
Domain IPs 2600:1413:b000:6::17d5:2bd3, 2600:1413:b000:6::17d5:2bdf, 96.17.96.23, 96.17.96.26
Response IP 23.44.4.155
Found Yes
Hash ce9e98c15d10b651d1f8d64317f45c51599ab021e53ae33dd4ab392ddd50524f
SimHash 4030ca13b457

Groups

*

Rule Path
Allow /ads.txt
Disallow /rc/
Disallow /swf

googlebot

Rule Path
Disallow /rc/
Disallow /search/
Disallow /ranking/word/
Disallow /dictionary/sct-*
Disallow /dictionary/svc-*
Disallow /dictionary/sdo-rcm/sct-*
Disallow /dictionary/sdo-rcm/svc-*
Disallow /dictionary/sdo-new/sct-*
Disallow /dictionary/sdo-new/svc-*
Disallow /column/sct-*
Disallow /column/*/sct-*
Disallow /column/article/*/sct-*
Disallow /expert/list/sct-*
Disallow /mypage/account/edit/first/

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 300

teoma

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 300

ask jeeves

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 300

yetibot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 300