hifi.lu
robots.txt

Robots Exclusion Standard data for hifi.lu

Resource Scan

Scan Details

Site Domain hifi.lu
Base Domain hifi.lu
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-04-01T06:44:21+00:00
Next Scan 2024-06-30T06:44:21+00:00

Last Successful Scan

Scanned2023-02-13T20:21:45+00:00
URL https://www.hifi.lu/robots.txt
Domain IPs 23.215.7.19, 23.215.7.26, 2600:1413:1::76d6:a761, 2600:1413:1::76d6:a777
Response IP 96.17.180.3
Found Yes
Hash 68faee40ddb1210aeb28e7d3362a0b96cb4243df4bca51c11f10d20a6171e52a
SimHash 00d7d196cdf9

Groups

*

Rule Path
Disallow *q%3D
Disallow *pageSize%3D
Disallow */budget-card
Disallow */cart
Disallow */checkout
Disallow */instore
Disallow */login
Disallow */myaccount
Disallow */santander
Disallow */sharedlist
Disallow */contentful-preview/*
Disallow /de/lieferanten
Disallow /fr/fournisseurs
Disallow /de/listen
Disallow /fr/listes
Disallow /*/c/*q%3D
Disallow /*/c/*pageSize%3D
Disallow /*/c/*reviewPage%3D
Disallow /nl/de/*

cazoodlebot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

teleport

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

psbot

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

larbin

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

moget

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.hifi.lu/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Block bot crawling en language.
  • Sitemap
  • Block CazoodleBot as it does not present correct accept content headers
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Yandex bot - A rule breaker, just as Baidu spiders
  • Worst bots according to https://www.benfrederickson.com/robots-txt-analysis/