halian.com
robots.txt

Robots Exclusion Standard data for halian.com

Resource Scan

Scan Details

Site Domain halian.com
Base Domain halian.com
Scan Status Ok
Last Scan2026-03-05T07:36:45+00:00
Next Scan 2026-04-04T07:36:45+00:00

Last Scan

Scanned2026-03-05T07:36:45+00:00
URL https://halian.com/robots.txt
Redirect https://www.halian.com/robots.txt
Redirect Domain www.halian.com
Redirect Base halian.com
Domain IPs 104.21.39.172, 172.67.147.99, 2606:4700:3033::6815:27ac, 2606:4700:3037::ac43:9363
Redirect IPs 104.21.39.172, 172.67.147.99, 2606:4700:3033::6815:27ac, 2606:4700:3037::ac43:9363
Response IP 172.67.147.99
Found Yes
Hash fe9f988434893341401321d01e19e767042876d6d36dd683c20a6022b606fe93
SimHash a964d0b127b0

Groups

*

Rule Path
Disallow /hs/
Disallow /hs-scripts/
Disallow /hs-web-interactives/
Disallow /hs-search
Disallow /hs-fs/
Disallow /hs/cta/
Disallow /404
Disallow /500
Disallow /*?hsDebug=
Disallow /*?hsCacheBuster=
Disallow /*?hs_amp=true
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsLang=*
Allow /hubfs/
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://www.halian.com/sitemap.xml

Comments

  • Disallow HubSpot system directories
  • Disallow error pages
  • Disallow query parameters that cause duplicate content
  • Allow HubSpot file system (images, PDFs, etc.)