hexa.cc
robots.txt

Robots Exclusion Standard data for hexa.cc

Resource Scan

Scan Details

Site Domain hexa.cc
Base Domain hexa.cc
Scan Status Ok
Last Scan2025-07-14T21:37:07+00:00
Next Scan 2025-08-13T21:37:07+00:00

Last Scan

Scanned2025-07-14T21:37:07+00:00
URL https://www.hexa.cc/robots.txt
Redirect https://www.hexa.com/robots.txt
Redirect Domain www.hexa.com
Redirect Base hexa.com
Domain IPs 2001:4b98:e01::38, 217.70.184.56
Redirect IPs 198.202.211.1, 2620:cb:2000::1
Response IP 198.202.211.1
Found Yes
Hash a04d6eb96c689b69c25b062ddbda5abc35f9250afd47f8dab7cfd922502918f9
SimHash 707aed678697

Groups

*

Rule Path
Disallow */jobs?
Disallow */companies/placeholder
Disallow */companies/hexa
Disallow */companies/efounders
Disallow */companies/3founders
Disallow */companies/logicfounders
Disallow *?search=
Disallow *compagnies?studio
Disallow *blog?
Disallow *fintech?
Disallow *news?
Disallow *startups?
Disallow *future-of-work?
Disallow *insights?
Disallow *latest-articles?
Disallow *web3?
Disallow *ref?

Other Records

Field Value
sitemap https://www.hexa.com/sitemap.xml
sitemap https://www.hexa.com/sitemap.xml