haltech.com
robots.txt

Robots Exclusion Standard data for haltech.com

Resource Scan

Scan Details

Site Domain haltech.com
Base Domain haltech.com
Scan Status Ok
Last Scan2025-06-18T05:45:42+00:00
Next Scan 2025-07-18T05:45:42+00:00

Last Scan

Scanned2025-06-18T05:45:42+00:00
URL https://www.haltech.com/robots.txt
Domain IPs 104.22.0.134, 104.22.1.134, 172.67.29.214, 2606:4700:10::6816:186, 2606:4700:10::6816:86, 2606:4700:10::ac43:1dd6
Response IP 104.22.0.134
Found Yes
Hash 537998dc0a5af2fc6a243cd7883866b0deb7ee068191f1ca41fcb42f15c60fff
SimHash 4276cd627e64

Groups

dotbot

Rule Path
Disallow /

googlebot-image

Rule Path
Allow /

mediapartners-google

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

*

Rule Path
Disallow /documents/
Disallow /haltech.com.au/
Disallow /help/
Disallow /images/
Disallow /shopbuttons/
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-content/
Disallow /default/catalogsearch/
Disallow /usastore/catalogsearch/
Disallow /exportstore/catalogsearch/
Disallow /Store/
Disallow /sub/
Disallow /reports/
Disallow /soft-home/
Disallow /wp-content/uploads/s-series/
Allow /wp-content/themes/haltech/assets/
Allow /wp-content/themes/haltech/online_store.css
Allow /wp-content/themes/haltech/style.css
Allow /wp-content/uploads/
Allow /wp-includes/js/
Allow /wp-includes/css/
Allow /wp-content/plugins/woocommerce/assets/css/

Comments

  • Google Image
  • Google AdSense
  • global
  • Allow the bots to crawl static assets