archideninterior.com
robots.txt

Robots Exclusion Standard data for archideninterior.com

Resource Scan

Scan Details

Site Domain archideninterior.com
Base Domain archideninterior.com
Scan Status Ok
Last Scan2024-09-02T19:05:10+00:00
Next Scan 2024-10-02T19:05:10+00:00

Last Scan

Scanned2024-09-02T19:05:10+00:00
URL https://archideninterior.com/robots.txt
Domain IPs 69.164.219.249
Response IP 69.164.219.249
Found Yes
Hash c069f1619fcfb765b2164bdadae295bd21eb6eb3079272dcc3e52e8e520cdcb6
SimHash 71dc4273c9f6

Groups

*

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow *?replytocom
Disallow /wp-content/plugins/

Other Records

Field Value
crawl-delay 3600

*

Rule Path
Disallow /?blackhole

ninjabot

Rule Path
Allow /

mediapartners-google*

Rule Path
Allow /

googlebot-image

Rule Path
Allow /wp-content/uploads/

yandex

Rule Path Comment
Disallow / blocks access to the whole site

sistrix

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

jobs.de-robot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

spbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

obot

Rule Path
Disallow /

fr-crawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

cloudservermarketspider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

linkstats

Rule Path
Disallow /

jobboersebot

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

plista

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

mindupbot

Rule Path
Disallow /

sg-orbiter

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

kraken

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

openhosebot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

thumbsniper

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

implisensebot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

blexbot
wget/1.6
wget/1.5.3
fairad client
url_spider_pro
mozilla
turingos
python-urllib
teleport
stanford comp sci
grub-client
webbandit/3.50
ia_archiver/1.6
propowerbot/2.14
naver
repomonkey
mozilla/4.0 (compatible; msie 4.0; windows 2000)
spankbot
webzip
moget/2.1
gigabot
xenu's link sleuth 1.1c
spanner
netmechanic
cheesebot
zeus 32297 webster pro v2.9 win32
jetbot
mozilla/4.0 (compatible; msie 4.0; windows 98)
sootle
webcopier
lnspiderguy
mozilla/4.0 (compatible; bullseye; windows 95)
copyrightcheck
mozilla
emailwolf
keyword density/0.9
webmasterworld extractor
cherrypickerse/1.0
moget
es
webster pro
extractorpro
microsoft url control - 5.01.4511
openfind
exabot
hatena antenna
psbot
thenomad
ahrefsbot
http://www.searchengineworld.com bot
rogerbot
suzuran
www-collector-e
emailcollector
perman
webauto
getright/4.2
mozilla/4
vci webviewer vci webviewer win32
xenu's
ia_archiver
backdoorbot/1.0
jetbot/1.0
humanlinks
webenhancer
dittospyder
harvest/1.5
teleportpro
rma
becomebot
mozilla/3
scooter
true_robot
aqua_products
b2w/0.1
botalot
webmasterworldforumbot
linkscan/8.1a unix
enterprise_search/1.0
copernic
flaming attackbot
cosmos
nicerspro
openbot
microsoft url control
emailsiphon
grub
stanford compclub
teoma
kenjin spider
foobot
true_robot/1.0
lexibot
stanford
cherrypickerelite/1.0
msiecrawler
oracle ultra search
mozilla/5
linkwalker
larbin
zeus link scout
stanford spiderboys
alexibot
crescent
blekkobot
mister pix
website quester
searchpreview
http://www.webmasterworld.com bot
wget
crescent internet toolpak http ole control v.1.0
web image collector
lwp-trivial/1.34
cherrypicker
queryn metasearch
nutch
mata hari
dotbot
bookmark search tool
the intraformant
tocrawl/urldispatcher
gaisbot
webbandit
repomonkey bait & tackle/v1.01
miixpc/4.2
bullseye/1.0
builtbottough
enterprise_search
blowfish/1.0
asterias
lwp-trivial
dumbot
iron33/1.0.2
zeus
jennybot
vci
url control
hloader
mozilla/4.0 (compatible; msie 4.0; windows 95)
erocrawler
bunnyslippers
libweb/clshttp
radiation retriever 1.1
microsoft url control - 6.00.8169
offline explorer
mj12bot
stanford compsciclub
linkextractorpro
mozilla/4.0 (compatible; msie 4.0; windows xp)
szukacz/1.4
sitesnagger
prowebwalker
netants
infonavirobot
telesoft
webzip/4.0
websauger
miixpc
httplib
webstripper
webvac
urly warning
mozilla/4.0 (compatible; msie 4.0; windows nt)
openfind data gathere
looksmart
mail.ru

Rule Path
Disallow /

Other Records

Field Value
sitemap https://archideninterior.com/sitemap_index.xml

Comments

  • disallow all files in these directories
  • Disallow: Sistrix
  • Disallow: Sistrix
  • Disallow: Sistrix
  • Disallow: SEOkicks-Robot
  • Disallow: jobs.de-Robot
  • Backlink Analysis
  • Bot der Leipziger Unister Holding GmbH
  • http://moz.com/products
  • http://www.searchmetrics.com
  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://www.domaintools.com/webmasters/surveybot.php
  • http://www.seodiver.com/bot
  • http://openlinkprofiler.org/bot
  • http://www.wotbox.com/bot/
  • http://www.opensiteexplorer.org/dotbot
  • http://moz.com/researchtools/ose/dotbot
  • http://www.meanpath.com/meanpathbot.html
  • http://www.backlinktest.com/crawler.html
  • http://www.brandwatch.com/magpie-crawler/
  • http://filterdb.iss.net/crawler/
  • http://webmeup-crawler.com
  • https://megaindex.com/crawler
  • http://www.cloudservermarket.com
  • http://www.trendiction.de/de/publisher/bot
  • http://www.exalead.com
  • http://www.career-x.de/bot.html
  • https://www.lipperhey.com/en/about/
  • https://www.lipperhey.com/en/about/
  • https://turnitin.com/robot/crawlerinfo.html
  • http://help.coccoc.com/
  • ubermetrics-technologies.com
  • datenbutler.de
  • http://searchgears.de/uber-uns/crawling-faq.html
  • http://commoncrawl.org/faq/
  • https://www.qwant.com/
  • http://linkfluence.net/
  • http://www.botje.com/plukkie.htm
  • https://www.safedns.com/searchbot
  • http://www.haosou.com/help/help_3_2.html
  • http://www.haosou.com/help/help_3_2.html
  • http://www.moz.com/dp/rogerbot
  • http://www.openhose.org/bot.html
  • http://www.screamingfrog.co.uk/seo-spider/
  • http://thumbsniper.com
  • http://www.radian6.com/crawler
  • http://cliqz.com/company/cliqzbot
  • https://www.aihitdata.com/about
  • http://www.trendiction.com/en/publisher/bot
  • http://warebay.com/bot.html

Warnings

  • 2 invalid lines.