hexa.cc
robots.txt
Robots Exclusion Standard data for hexa.cc
Resource Scan
Scan Details
Site Domain | hexa.cc |
Base Domain | hexa.cc |
Scan Status | Ok |
Last Scan | 2025-07-14T21:37:07+00:00 |
Next Scan | 2025-08-13T21:37:07+00:00 |
Last Scan
Scanned | 2025-07-14T21:37:07+00:00 |
URL | https://www.hexa.cc/robots.txt |
Redirect | https://www.hexa.com/robots.txt |
Redirect Domain | www.hexa.com |
Redirect Base | hexa.com |
Domain IPs | 2001:4b98:e01::38, 217.70.184.56 |
Redirect IPs | 198.202.211.1, 2620:cb:2000::1 |
Response IP | 198.202.211.1 |
Found | Yes |
Hash | a04d6eb96c689b69c25b062ddbda5abc35f9250afd47f8dab7cfd922502918f9 |
SimHash | 707aed678697 |
Groups
*
Rule | Path |
---|---|
Disallow | */jobs? |
Disallow | */companies/placeholder |
Disallow | */companies/hexa |
Disallow | */companies/efounders |
Disallow | */companies/3founders |
Disallow | */companies/logicfounders |
Disallow | *?search= |
Disallow | *compagnies?studio |
Disallow | *blog? |
Disallow | *fintech? |
Disallow | *news? |
Disallow | *startups? |
Disallow | *future-of-work? |
Disallow | *insights? |
Disallow | *latest-articles? |
Disallow | *web3? |
Disallow | *ref? |
Other Records
Field | Value |
---|---|
sitemap | https://www.hexa.com/sitemap.xml |
sitemap | https://www.hexa.com/sitemap.xml |