blesscollectionhotels.com
robots.txt

Robots Exclusion Standard data for blesscollectionhotels.com

Resource Scan

Scan Details

Site Domain blesscollectionhotels.com
Base Domain blesscollectionhotels.com
Scan Status Ok
Last Scan2024-05-29T18:34:42+00:00
Next Scan 2024-06-28T18:34:42+00:00

Last Scan

Scanned2024-05-29T18:34:42+00:00
URL https://blesscollectionhotels.com/robots.txt
Redirect https://www.blesscollectionhotels.com/robots.txt
Redirect Domain www.blesscollectionhotels.com
Redirect Base blesscollectionhotels.com
Domain IPs 13.35.18.120, 13.35.18.126, 13.35.18.2, 13.35.18.85
Redirect IPs 13.35.18.120, 13.35.18.126, 13.35.18.2, 13.35.18.85
Response IP 13.35.18.2
Found Yes
Hash 2aabc471cbbd3586367adab4d182c95830770b0fb89e791122cc135e9d96e051
SimHash b4595112cef5

Groups

*

Rule Path
Allow /

semrushbot-sa

Rule Path
Allow /
Disallow /es/cart
Disallow /es/checkout
Disallow /es/my-account
Disallow /en/cart
Disallow /en/checkout
Disallow /en/my-account
Disallow /es/ibiza-santaeulalia/bless-hotel-ibiza
Disallow /en/ibiza-santaeulalia/bless-hotel-ibiza
Disallow /es/ibiza-santaeulalia/bless-hotel-ibiza/
Disallow /en/ibiza-santaeulalia/bless-hotel-ibiza/

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

orthogaffe

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

wget

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.blesscollectionhotels.com/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Allow search crawlers to discover the sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Bloqueo de bots y crawlers poco utiles