getharvest.com
robots.txt
Robots Exclusion Standard data for getharvest.com
Resource Scan
Scan Details
Site Domain | getharvest.com |
Base Domain | getharvest.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-06-19T02:28:13+00:00 |
Next Scan | 2024-07-03T02:28:13+00:00 |
Last Successful Scan
Scanned | 2023-08-07T21:37:20+00:00 |
URL | https://getharvest.com/robots.txt |
Domain IPs | 199.60.103.164, 199.60.103.64 |
Response IP | 199.60.103.64 |
Found | Yes |
Hash | a130de74268a745c42dfb68a4c816e354e50230d2cd7486857b3976998f221e4 |
SimHash | b84ddee00fd2 |
Groups
*
Rule | Path |
---|---|
Disallow | /invitation/ |
Disallow | /generate/ |
Disallow | /_hcms/preview/ |
Disallow | /hs/manage-preferences/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.getharvest.com/sitemap.xml.gz |