getdoc.com
robots.txt

Robots Exclusion Standard data for getdoc.com

Resource Scan

Scan Details

Site Domain getdoc.com
Base Domain getdoc.com
Scan Status Ok
Last Scan2024-09-20T01:35:27+00:00
Next Scan 2024-10-20T01:35:27+00:00

Last Scan

Scanned2024-09-20T01:35:27+00:00
URL https://getdoc.com/robots.txt
Redirect https://www.getdoc.com/robots.txt
Redirect Domain www.getdoc.com
Redirect Base getdoc.com
Domain IPs 13.213.186.39, 13.229.19.112
Redirect IPs 13.213.186.39, 13.229.19.112
Response IP 13.213.186.39
Found Yes
Hash 054fdde224860d9fdcd19e2a4f3ca84fd10383fd6b9cb7ff8cde4291c14d4fe4
SimHash 599779e04410

Groups

google

Rule Path
Disallow

linguee
surdotlybot
panscient.com
vscooter
psbot
ia_archiver
mj12bot
twiceler
yandex
taptubot
twengabot
sitebot
baiduspider
ahrefsbot
ezooms
sistrix
aihitbot
infopath
infopath.2
swebot
ec2linkfinder
turnitinbot

Rule Path
Disallow /

searchmetericsbot
wbsearchbot
exabot
sosospider
ip-web-crawler.com
netestate ne crawler
aboundexbot
aboundex
meanpathbot
mail.ru
spbot
archive.org_bot
linkpadbot
easouspider
seznambot
wotbox
blexbot
xovibot
semrushbot
a6-indexer
riddler
loadtimebot
obot
mojeekbot
memorybot

Rule Path
Disallow /

advbot
smtbot
yisouspider
lssrocketcrawler
gsa-crawler
nutch
tbot-nutch
thunderstone
yacybot
ranksonicbot
betabot
parsijoo-bot
nextgensearchbot
gocrawl
plukkie
applebot
lipperhey
safednsbot
rome client
rankactivelinkbot
sogou web spider
uptimebot
seeker
cliqzbot
domaincrawler
yoozbot
coccocbot-web
qwantify
siteexplorer
findxbot
garlikcrawler
zoominfobot
bubing
barkrowler
rogerbot
dotbot
jamesbot
contacts-crawler
ccbot
idbot
dnyzbot
piplbot
alphabot
alphaseobot
alphaseobot-sa
seokicks-robot
ltx71
semrushbot
linkdexbot
megaindex.ru
megaindex.com

Rule Path
Disallow /

Warnings

  • 1 invalid line.