researchweb.org
robots.txt

Robots Exclusion Standard data for researchweb.org

Resource Scan

Scan Details

Site Domain researchweb.org
Base Domain researchweb.org
Scan Status Ok
Last Scan2024-11-03T13:47:13+00:00
Next Scan 2024-12-03T13:47:13+00:00

Last Scan

Scanned2024-11-03T13:47:13+00:00
URL https://researchweb.org/robots.txt
Domain IPs 193.93.251.228
Response IP 193.93.251.228
Found Yes
Hash ae1ac6078ec5b9c2d195ad1af1c0cb08db50d67245c2aca3810c569ac1dbc0ea
SimHash 5017edd24d88

Groups

*

Rule Path
Disallow /info/dir/

*

Rule Path
Disallow /*?*

obot
bbot
brands-bot-logo
clarabot
serpstatbot
seekport
datanyze
experibot
indeedbot
extlinksbot
crawler4j
dataprovider
daum
mauibot
panscient.com
vscooter
psbot
ia_archiver
mj12bot
twiceler
yandex
taptubot
googlebot-image
twengabot
sitebot
baiduspider
ahrefsbot
ezooms
sistrix
aihitbot
infopath
infopath.2
swebot
ec2linkfinder
turnitinbot
the knowledge ai
mappy
petalbot

Rule Path
Disallow /

searchmetericsbot
wbsearchbot
exabot
sosospider
ip-web-crawler.com
netestate ne crawler
aboundexbot
aboundex
meanpathbot
mail.ru
spbot
archive.org_bot
linkpadbot
easouspider
seznambot
wotbox
blexbot
xovibot
semrushbot
a6-indexer
riddler
loadtimebot
obot
mojeekbot
memorybot
ltx71

Rule Path
Disallow /

advbot
smtbot
yisouspider
lssrocketcrawler
gsa-crawler
nutch
tbot-nutch
thunderstone
yacybot
ranksonicbot
betabot
parsijoo-bot
nextgensearchbot
gocrawl
plukkie
applebot
lipperhey
safednsbot
rankactivelinkbot
sogou blog
sogou inst spider
sogou news spider
sogou orion spider
sogou spider2
sogou web spider
uptimebot
seeker
cliqzbot
domaincrawler
yoozbot
coccocbot-web
qwantify
siteexplorer
findxbot
garlikcrawler
zoominfobot
bubing
barkrowler
rogerbot
dotbot
jamesbot
contacts-crawler
ccbot
idbot
dnyzbot
piplbot
alphabot
alphaseobot
alphaseobot-sa
seokicks-robot

Rule Path
Disallow /

Warnings

  • 1 invalid line.