beommusa.com
robots.txt

Robots Exclusion Standard data for beommusa.com

Resource Scan

Scan Details

Site Domain beommusa.com
Base Domain beommusa.com
Scan Status Ok
Last Scan2024-11-14T16:30:09+00:00
Next Scan 2024-12-14T16:30:09+00:00

Last Scan

Scanned2024-11-14T16:30:09+00:00
URL https://beommusa.com/robots.txt
Domain IPs 52.78.116.139
Response IP 52.78.116.139
Found Yes
Hash c4a8b11b23878090a6e94a0910db736309650e52703beea5953e5037b7fbc8d3
SimHash 9b745062d5e0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

barkrowler/0.9

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

ccbot/2.0

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

lexxebot

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

mozilla/5.0 (compatible; spbot/2.0; http://www.seoprofiler.com/bot/ )

Rule Path
Disallow /

sosospider

Rule Path
Disallow /

sosospider+(+http://help.soso.com/webspider.htm)

Rule Path
Disallow /

sitebot/0.1

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

crystalsemanticsbot

Rule Path
Disallow /

netseer crawler

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

lexxebot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

discobot

Rule Path
Disallow /

jyxobot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm

Product Comment
sogou web spider/4.0(+http://www.sogou.com/docs/help/webmasters.htm 07)
Rule Path
Disallow /

sistrix

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

garlikcrawler/1.1 (http://garlik.com/, crawler@garlik.com)

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 5.0; windows nt; digext; dts agent

Rule Path
Disallow /

psbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

addthis.com robot tech.support@clearspring.com

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

proximic

Rule Path
Disallow /

discoverybot

Rule Path
Disallow /

bl.uk_lddc_bot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

bender

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

yasni

Rule Path
Disallow /

jikespider

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

pixray-seeker

Rule Path
Disallow /

linguee

Rule Path
Disallow /

integromedb

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

grapeshotcrawler

Rule Path
Disallow /

wesee:search

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

admantx

Rule Path
Disallow /

spbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

mozilla/5.0 (compatible; yandexbot/3.0; +http://yandex.com/bots)

Rule Path
Disallow /

yandex

Rule Path
Disallow /

mozilla/5.0 (compatible;picmole/1.0 +http://www.picmole.com)

Rule Path
Disallow /

ravencrawler

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

go-http-client

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

icc-crawler/2.0 (mozilla-compatible; ; http://ucri.nict.go.jp/en/icccrawler.html)

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

yeti

Rule Path
Allow /