harvesthoc.com
robots.txt
Robots Exclusion Standard data for harvesthoc.com
Resource Scan
Scan Details
| Site Domain | harvesthoc.com |
| Base Domain | harvesthoc.com |
| Scan Status | Failed |
| Failure Stage | Fetching resource. |
| Failure Reason | Server returned a client error. |
| Last Scan | 2025-12-09T07:05:52+00:00 |
| Next Scan | 2026-03-09T07:05:52+00:00 |
Last Successful Scan
| Scanned | 2024-07-14T23:19:27+00:00 |
| URL | https://harvesthoc.com/robots.txt |
| Domain IPs | 104.26.10.53, 104.26.11.53, 172.67.74.77, 2606:4700:20::681a:a35, 2606:4700:20::681a:b35, 2606:4700:20::ac43:4a4d |
| Response IP | 104.26.10.53 |
| Found | Yes |
| Hash | 66d61a321d68ef0ac714833f05276c7f7048581a5c416c430db93ded659552b9 |
| SimHash | 48818503a993 |
Groups
*
| Rule | Path |
|---|---|
| Allow | /wp-content/uploads/ |
| Allow | /wp-admin/admin-ajax.php |
| Disallow | /wp-content/plugins/ |
| Disallow | /wp-admin/ |
| Disallow | /calendar-all/ |
| Disallow | /*controller%3Dai1ec_exporter_controller* |
| Disallow | /*/action~*/ |
| Disallow | /calendar/ |
| Disallow | /events/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.harvesthoc.com/sitemap.xml |
| sitemap | https://www.harvesthoc.com/investor_sitemap.xml |