manor.ch
robots.txt

Robots Exclusion Standard data for manor.ch

Resource Scan

Scan Details

Site Domain manor.ch
Base Domain manor.ch
Scan Status Ok
Last Scan2024-09-15T21:31:46+00:00
Next Scan 2024-09-29T21:31:46+00:00

Last Scan

Scanned2024-09-15T21:31:46+00:00
URL https://manor.ch/robots.txt
Redirect https://www.manor.ch/robots.txt
Redirect Domain www.manor.ch
Redirect Base manor.ch
Domain IPs 20.54.216.111
Redirect IPs 13.107.246.59, 2620:1ec:bdf::59
Response IP 13.107.246.59
Found Yes
Hash 20f37a412df0b3ef7d6999ca2e0e4c2ccb3b14df61cb55fefb73666567958211
SimHash 7047863debf4

Groups

*

Rule Path
Disallow /de/cart
Disallow /fr/cart
Disallow /it/cart
Disallow /de/checkout
Disallow /fr/checkout
Disallow /it/checkout
Disallow /de/my-account
Disallow /fr/my-account
Disallow /it/my-account
Disallow /en/
Disallow */search*
Disallow /*?q=*
Disallow /*%26q%3D*
Disallow /de/Shop/Schmuck-%26-Uhren/
Disallow /fr/Shop/Bijoux-%26-Montres/
Disallow /it/Shop/Gioielli-%26-Orologi/
Disallow /de/Shop/Heim-%26-Haushalt/
Disallow /fr/Shop/Maison-%26-m%C3%A9nage
Disallow /it/Shop/Casa-%26-casalinghi/

cazoodlebot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot/1.0

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

pinterestbot

Rule Path
Disallow /shop/
Disallow /produktkollektionen/
Disallow /collections/

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.manor.ch/sitemap.xml

Comments

  • For all robots
  • Block internal search result pages
  • Block specific parameter urls
  • temporary fix for MCH-11223
  • Allow search crawlers to discover the sitemap
  • Sitemap paths must be ABSOLUTE and not relative.
  • Block CazoodleBot as it does not present correct accept content headers
  • Block MJ12bot as it is just noise
  • Block dotbot as it cannot parse base urls properly
  • Block Gigabot
  • Block AhrefsBot
  • Block Pinterest
  • block category pages
  • block collection pages