webshop.telegraaf.nl
robots.txt

Robots Exclusion Standard data for webshop.telegraaf.nl

Resource Scan

Scan Details

Site Domain webshop.telegraaf.nl
Base Domain telegraaf.nl
Scan Status Ok
Last Scan2024-05-07T04:23:42+00:00
Next Scan 2024-06-06T04:23:42+00:00

Last Scan

Scanned2024-05-07T04:23:42+00:00
URL https://webshop.telegraaf.nl/robots.txt
Domain IPs 104.18.34.171, 172.64.153.85, 2606:4700:4400::6812:22ab, 2606:4700:4400::ac40:9955
Response IP 172.64.153.85
Found Yes
Hash cbd275ab260536823f6a23a72bfe09a717bf18e05e36659fbd9c2f8456ee0204
SimHash 6d109da0abd0

Groups

*

Rule Path
Disallow /catalog/
Disallow /catalogsearch/
Disallow /checkout/
Disallow /customer/
Disallow /wishlist/
Disallow */stores/store/*
Disallow *?colour=*
Disallow *?mc_eid*
Disallow *?_escaped_fragment_=*
Disallow *price%3D*
Disallow *product_list_limit%3D*
Disallow *?option=*
Disallow *?___from_store=*

Other Records

Field Value
sitemap https://webshop.telegraaf.nl/sitemaps/telegraaf_aanbiedingen_nl_nl/sitemap.xml

Comments

  • Crawlers Setup
  • Paths (clean URLs)
  • Parameter URLs
  • Website Sitemap