hanwag.com
robots.txt

Robots Exclusion Standard data for hanwag.com

Resource Scan

Scan Details

Site Domain hanwag.com
Base Domain hanwag.com
Scan Status Ok
Last Scan2024-09-16T20:15:28+00:00
Next Scan 2024-09-30T20:15:28+00:00

Last Scan

Scanned2024-09-16T20:15:28+00:00
URL https://hanwag.com/robots.txt
Redirect https://www.hanwag.com/robots.txt
Redirect Domain www.hanwag.com
Redirect Base hanwag.com
Domain IPs 217.114.85.70
Redirect IPs 104.17.112.70, 104.17.113.70, 2606:4700::6811:7046, 2606:4700::6811:7146
Response IP 104.17.112.70
Found Yes
Hash 62566126f66a2d379c4952c45878102214ccdb95878d0ddd4040ce2c3090bd7c
SimHash 693c3159cff5

Groups

*

Rule Path
Disallow
Allow */globalassets/*
Allow /*/*/*?p=*
Disallow /*/*/*?p=*&*
Allow /*/*/*?_t_q*
Allow /*/*/*?v=*
Allow /*/*/*?recId=*
Allow */For?resources*
Allow /*/*/*?utm*
Disallow /*/*/*?*&*
Disallow /*/*/search
Disallow /EPiServer/CMS/
Disallow /Util/
Disallow /*/*/my-account/
Disallow /*/*/checkout
Disallow /*/*/checkout/checkout-interstitial
Disallow /*/*/error-pages/
Disallow /*/*/login/
Disallow /*/*/register/
Disallow /*/*/account/
Disallow /*/*/wishlist/
Disallow /*/*/passwordreset/request/
Disallow /*/*/*?promo_name*
Disallow /*/*/*?filter*
Disallow /*/*/*?q=*
Disallow /*/*/?filter%2F
Disallow /*/market/

Other Records

Field Value
sitemap https://www.hanwag.com/nl/nl-nl/Sitemap.xml
sitemap https://www.hanwag.com/eu/en-gb/Sitemap.xml
sitemap https://www.hanwag.com/de/de-de/Sitemap.xml
sitemap https://www.hanwag.com/us/en-us/Sitemap.xml

Comments

  • For product images and the like
  • For pagination but only without filters
  • For product links, color variations and recommendation function
  • Theme resource files
  • For UTM codes
  • Block for Filters, sorting etc. with more than 2 parameters
  • Block crawling of search
  • Old, to be sorted
  • Sitemaps