heloix.com
robots.txt

Robots Exclusion Standard data for heloix.com

Resource Scan

Scan Details

Site Domain heloix.com
Base Domain heloix.com
Scan Status Ok
Last Scan2025-10-31T21:35:23+00:00
Next Scan 2025-11-30T21:35:23+00:00

Last Scan

Scanned2025-10-31T21:35:23+00:00
URL https://heloix.com/robots.txt
Domain IPs 104.21.21.82, 172.67.197.14, 2606:4700:3033::6815:1552, 2606:4700:3037::ac43:c50e
Response IP 104.21.21.82
Found Yes
Hash 4eed055f0fe34cfa97762d5381fcc4caea36fbcd4a8b22f2c988b6425b6d9bed
SimHash 24d95d706fe5

Groups

*

Rule Path
Allow /
Allow /shop
Allow /product/
Allow /category/
Allow /blog/
Allow /bundles
Allow /compare
Allow /about
Allow /contact
Allow /support
Allow /careers
Allow /press
Allow /security
Allow /compare-alternatives
Allow /resources
Allow /webinars
Allow /consultation
Allow /sustainability
Allow /investor-relations
Allow /roadmap
Allow /academy
Allow /community
Allow /case-studies
Allow /request-quote
Allow /roi-calculator
Allow /reseller-program
Allow /affiliate-program
Allow /privacy-policy
Disallow /admin/
Disallow /dashboard/
Disallow /api/
Disallow /_next/
Disallow /onboarding/
Disallow /cart
Disallow /checkout
Disallow /wishlist
Disallow /addons
Disallow /order-confirmation
Disallow /auth/
Disallow /500
Disallow /maintenance
Disallow /search?*
Disallow /*?utm_*
Disallow /*?ref=*
Disallow /*?source=*
Disallow /*?campaign=*
Disallow /*.json$
Disallow /*.xml$
Disallow /*.txt$
Disallow /*.log$
Disallow /*.env$
Disallow /*.config$
Allow /css/
Allow /js/
Allow /_next/static/
Allow /images/
Allow /uploads/

googlebot

Rule Path
Allow /

bingbot

Rule Path
Allow /

slurp

Rule Path
Allow /

duckduckbot

Rule Path
Allow /

baiduspider

Rule Path
Allow /

yandexbot

Rule Path
Allow /

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://heloix.com/sitemap.xml

Comments

  • Heloix Robots.txt - Generated on 2025-10-31T21:35:25.648Z
  • Website: https://heloix.com
  • This file tells search engine crawlers which URLs they can access on this site.
  • For more information about robots.txt, visit: https://www.robotstxt.org/
  • Global rules for all user agents
  • Allow crawling of main content areas
  • Disallow admin and dashboard areas
  • Disallow user-specific pages
  • Disallow utility and maintenance pages
  • Disallow search and dynamic pages with parameters
  • Disallow file types that shouldn't be indexed
  • Allow CSS and JS for better rendering in search results
  • Specific rules for different bots
  • Block known bad bots and scrapers
  • Block AI training bots (optional - you may want to allow these)
  • Crawl delay for respectful crawling (in seconds)
  • Sitemap locations
  • Host directive (helps with canonicalization)
  • Additional notes:
  • - This robots.txt follows best practices for SaaS/e-commerce platforms
  • - Admin and user-specific areas are blocked to protect privacy
  • - Static assets are allowed for better search result rendering
  • - Known scraping bots are blocked to preserve server resources
  • - Crawl delay is set to be respectful to server resources
  • - Multiple sitemaps can be added as the site grows
  • Last updated: 2025-10-31T21:35:25.648Z

Warnings

  • `host` is not a known field.