harvesthoc.com
robots.txt

Robots Exclusion Standard data for harvesthoc.com

Resource Scan

Scan Details

Site Domain harvesthoc.com
Base Domain harvesthoc.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-09T07:05:52+00:00
Next Scan 2026-03-09T07:05:52+00:00

Last Successful Scan

Scanned2024-07-14T23:19:27+00:00
URL https://harvesthoc.com/robots.txt
Domain IPs 104.26.10.53, 104.26.11.53, 172.67.74.77, 2606:4700:20::681a:a35, 2606:4700:20::681a:b35, 2606:4700:20::ac43:4a4d
Response IP 104.26.10.53
Found Yes
Hash 66d61a321d68ef0ac714833f05276c7f7048581a5c416c430db93ded659552b9
SimHash 48818503a993

Groups

*

Rule Path
Allow /wp-content/uploads/
Allow /wp-admin/admin-ajax.php
Disallow /wp-content/plugins/
Disallow /wp-admin/
Disallow /calendar-all/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/
Disallow /calendar/
Disallow /events/

Other Records

Field Value
sitemap https://www.harvesthoc.com/sitemap.xml
sitemap https://www.harvesthoc.com/investor_sitemap.xml