hugo-sachs.de
robots.txt

Robots Exclusion Standard data for hugo-sachs.de

Resource Scan

Scan Details

Site Domain hugo-sachs.de
Base Domain hugo-sachs.de
Scan Status Ok
Last Scan2025-10-07T21:24:36+00:00
Next Scan 2025-10-21T21:24:36+00:00

Last Scan

Scanned2025-10-07T21:24:36+00:00
URL https://www.hugo-sachs.de/robots.txt
Domain IPs 3.174.46.107, 3.174.46.124, 3.174.46.71, 3.174.46.76
Response IP 3.169.71.103
Found Yes
Hash a961c5f37bb731bf5a0edd0cb6e6ba4c28758882e6da7a4816683f6ab6e8d414
SimHash 6124f942c182

Groups

*

Rule Path
Disallow /*?
Disallow /index.php/
Disallow /wishlist/
Disallow /admin/
Disallow /catalogsearch/
Disallow /onestepcheckout/
Disallow /review/product/
Disallow /sendfriend/
Disallow /enable-cookies/
Disallow /LICENSE.txt
Disallow /LICENSE.html
Disallow /skin/
Disallow /js/
Disallow /directory/
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /pub/
Disallow /tag/
Disallow /review/

Comments

  • Robots.txt
  • Prevent crawl of user and login pages
  • Block native product URL (Only crawl by url key)
  • Block crawl of filters in pages
  • Block CMS directories
  • Block duplicate content