barcelona-tourist-guide.com
robots.txt

Robots Exclusion Standard data for barcelona-tourist-guide.com

Resource Scan

Scan Details

Site Domain barcelona-tourist-guide.com
Base Domain barcelona-tourist-guide.com
Scan Status Ok
Last Scan2024-04-22T02:52:38+00:00
Next Scan 2024-05-22T02:52:38+00:00

Last Scan

Scanned2024-04-22T02:52:38+00:00
URL https://barcelona-tourist-guide.com/robots.txt
Redirect https://www.barcelona-tourist-guide.com/robots.txt
Redirect Domain www.barcelona-tourist-guide.com
Redirect Base barcelona-tourist-guide.com
Domain IPs 104.26.14.45, 104.26.15.45, 172.67.68.188
Redirect IPs 104.26.14.45, 104.26.15.45, 172.67.68.188
Response IP 104.26.15.45
Found Yes
Hash cffc9e55d1087ce8ffc086e9329ca9bd66705d89e56af9f07ee4dc91505afa96
SimHash e1971fe3c9e7

Groups

mediapartners-google*
googlebot

Rule Path
Disallow /*?
Disallow /captcha/
Disallow /cgi-bin/
Disallow /formmail/
Disallow /search/
Disallow /faq/search/
Disallow /faq/attachments/
Disallow /faq/admin/tmp/
Disallow /faq/images/
Disallow /faq/templates/
Disallow /Library/
Disallow /mp3files/
Disallow /quicknote/
Disallow /smplayers/
Disallow /dwTemplates/
Disallow /ssi/
Disallow /sites/
Allow /0-Config/*?
Allow /0-Scripts/*?

*

Rule Path
Disallow /captcha/
Disallow /cgi-bin/
Disallow /formmail/
Disallow /search/
Disallow /faq/search/
Disallow /faq/attachments/
Disallow /faq/admin/tmp/
Disallow /faq/images/
Disallow /faq/templates/
Disallow /Library/
Disallow /mp3files/
Disallow /quicknote/
Disallow /smplayers/
Disallow /dwTemplates/
Disallow /ssi/
Disallow /sites/

ahrefsbot
alexibot
aqua_products
asterias
b2w/0.1
backdoorbot/1.0
blexbot
blowfish/1.0
bookmark search tool
botalot
botrighthere
builtbottough
bullseye/1.0
bunnyslippers
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
dittospyder
dotbot
emailcollector
emailwolf
erocrawler
extractorpro
fairad client
flaming attackbot
foobot
gaisbot
getright/4.2
harvest/1.5
hloader
httplib
httrack 3.0
humanlinks
infonavirobot
iron33/1.0.2
jennybot
kenjin spider
keyword density/0.9
larbin
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lnspiderguy
lwp-trivial
lwp-trivial/1.34
mata hari
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mj12bot
moget
moget/2.1
mozilla/4.0 (compatible; bullseye; windows 95)
msiecrawler
netants
netmechanic
nicerspro
offline explorer
openbot
openfind
openfind data gatherer
oracle ultra search
perman
propowerbot/2.14
prowebwalker
psbot
python-urllib
queryn metasearch
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
rma
rogerbot
searchpreview
semrushbot
sitesnagger
spankbot
spanner
spbot
suzuran
szukacz/1.4
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
turnitinbot/1.5
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
web image collector
webauto
webbandit
webbandit/3.50
webcapture 2.0
webcopier
webcopier v.2.2
webcopier v3.2a
webenhancer
websauger
website quester
webster pro
webstripper
webzip
webzip/4.0
webzip/4.21
webzip/5.0
wget
wget
wget/1.5.3
wget/1.6
www-collector-e
xenu
xenu's
xenu's link sleuth 1.1c
xenu's link sleuth 1.2
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout

Rule Path
Disallow /

geturl.rexx v1.05
htmlgobble v2.2
ibm_planetwide,
monster/vx.x.x -$type ($ostype)
netcarta cyberpilot pro
packrat/1.0
shai'hulud
tarspider
templeton/
w3mir
webcopy/
webfetcher/0.8,
webvac/1.0
webwalk
wget/1.4.0
xget/0.7

Rule Path
Disallow /

gptbot
ccbot

Rule Path
Disallow /

Comments

  • specifically allowed robots:
  • pages that should not be indexed:
  • Bad robots list exluding mirrors which are listed separatly below
  • End Bad robots list
  • The following robots are specifically banned because they are mirror robots
  • End of mirror robots
  • Begin AI Crawlers
  • End AI Crawlers