helvex.com
robots.txt

Robots Exclusion Standard data for helvex.com

Resource Scan

Scan Details

Site Domain helvex.com
Base Domain helvex.com
Scan Status Ok
Last Scan2024-09-23T00:55:53+00:00
Next Scan 2024-10-23T00:55:53+00:00

Last Scan

Scanned2024-09-23T00:55:53+00:00
URL https://helvex.com/robots.txt
Domain IPs 13.33.30.115, 13.33.30.21, 13.33.30.30, 13.33.30.83
Response IP 13.33.30.115
Found Yes
Hash 8bc4e9c0ad48d4932d6c933b5b17559fa4a6fe0475468a84326fd18f20d7baf2
SimHash 2b09712fc5d3

Groups

*

Rule Path
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalogsearch/
Disallow /control/
Disallow /contacts/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /sendfriend/
Disallow /wishlist/
Disallow /review/product/listAjax/id/
Disallow /*?*product_list_mode=
Disallow /*?*product_list_order=
Disallow /*?*product_list_limit=
Disallow /*?*product_list_dir=
Disallow /*.php$
Disallow /*?SID=

Comments

  • TODO: Update this with appropriate url
  • Sitemap: https://www.example.com/sitemap.xml
  • Paths (clean URLs)
  • Need to allow these urls paths to be indexed, since M2 commonly includes these links in sitemap.xml
  • Disallow: /catalog/category/view/
  • Disallow: /catalog/product/view/
  • Do not index pages that are sorted or filtered.
  • Do not index session ID