moebel-boss.de
robots.txt

Robots Exclusion Standard data for moebel-boss.de

Resource Scan

Scan Details

Site Domain moebel-boss.de
Base Domain moebel-boss.de
Scan Status Ok
Last Scan2024-09-18T02:40:03+00:00
Next Scan 2024-10-18T02:40:03+00:00

Last Scan

Scanned2024-09-18T02:40:03+00:00
URL https://moebel-boss.de/robots.txt
Domain IPs 23.215.7.14, 23.215.7.5, 2600:1413:b000:1b::17d7:705, 2600:1413:b000:1b::17d7:70e
Response IP 96.17.180.42
Found Yes
Hash 5d17997d13f5ee2d4666a24d7fa562321f433bdcfecebd8c95a864a12720b0ac
SimHash aef65f5accfa

Groups

*
adsbot-google

Rule Path
Disallow /cart
Disallow /c/
Disallow /checkout
Disallow /my-account
Disallow /services/
Disallow /media/
Disallow /search
Disallow /my-market

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://moebel-boss.de/api/occ/v2/boss/sitemap.xml

Comments

  • For all robots
  • Block access to specific groups of pages
  • Disallow: /login/pw/request
  • Black some specific pages
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot