holity.com
robots.txt

Robots Exclusion Standard data for holity.com

Resource Scan

Scan Details

Site Domain holity.com
Base Domain holity.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-07-09T22:47:19+00:00
Next Scan 2025-10-07T22:47:19+00:00

Last Successful Scan

Scanned2023-08-27T11:53:13+00:00
URL https://holity.com/robots.txt
Redirect https://www.holity.com/robots.txt
Redirect Domain www.holity.com
Redirect Base holity.com
Domain IPs 104.26.14.232, 104.26.15.232, 172.67.72.91, 2606:4700:20::681a:ee8, 2606:4700:20::681a:fe8, 2606:4700:20::ac43:485b
Redirect IPs 104.26.14.232, 104.26.15.232, 172.67.72.91, 2606:4700:20::681a:ee8, 2606:4700:20::681a:fe8, 2606:4700:20::ac43:485b
Response IP 104.26.15.232
Found Yes
Hash 6d5bdae6728a554c68939e01263ebb503f28d216f2e1e8cddf1afe4d8b4370e6
SimHash 091cb9165ed9

Groups

*

Rule Path
Allow /*?p=
Allow /catalog/seo_sitemap/category/
Allow /catalogsearch/result/
Disallow /cron.sh
Disallow /install.php
Disallow /*?p=*&
Disallow /*?SID=
Disallow /*?limit=all
Disallow /*?dir=*&
Disallow /*?price=
Disallow /*?limit=
Disallow /*?cat=

baiduspider

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

obot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /
Disallow /privacy.html
Disallow */wp/wp-admin
Disallow */wp/wp-includes
Disallow */wp/wp-content/

Other Records

Field Value
sitemap https://www.holity.com/sitemaps/sitemap_index.xml

Comments

  • Crawlers Setup
  • Allowable Index
  • Directories
  • Disallow: /media/
  • Paths (clean URLs)
  • Disallow: /catalog/product/view/ (eliminare perché già presente rel canonical; es: http://www.holity.com/catalog/product/view/id/8983/s/lampione-in-ferro-battuto-trattato-a-bagno-di-zincatura-h16876/category/127/)
  • Disallow: /tag/ (eliminare perché response code 404, chiedere se i tag li vuole indicizzare, in ogni caso implementare il rel canonical sul tag)
  • Files
  • Paths (no clean URLs)
  • Disallow: /*.js$
  • Disallow: /*.css$
  • Disallow: /*.php$
  • (eliminare ma prima risolvere ID 47)
  • Uncomment if you do not wish for Google to index your images
  • User-agent: Googlebot-Image
  • Disallow: /

Warnings

  • 2 invalid lines.