harvest.org
robots.txt

Robots Exclusion Standard data for harvest.org

Resource Scan

Scan Details

Site Domain harvest.org
Base Domain harvest.org
Scan Status Ok
Last Scan2025-09-20T13:37:59+00:00
Next Scan 2025-10-04T13:37:59+00:00

Last Scan

Scanned2025-09-20T13:37:59+00:00
URL https://harvest.org/robots.txt
Domain IPs 141.193.213.10, 141.193.213.11
Response IP 141.193.213.10
Found Yes
Hash ca209a61d809a7dbe5705a14fea84e6de378ca9b15659bfddc810d9c2a8dea06
SimHash c830ffaa238e

Groups

*

Rule Path
Disallow /wp-admin/
Disallow /tag
Disallow /authors
Disallow /state-province
Disallow /?s=
Disallow /search/
Disallow /feed/
Disallow */feed/
Disallow /feed/$
Disallow /comments/feed
Disallow /trackback/
Disallow */comments$
Disallow */feed
Disallow */feed$
Disallow */trackback
Disallow */trackback$
Disallow /?feed=
Disallow /wp-comments
Disallow /wp-feed
Disallow /wp-trackback
Disallow */replytocom%3D
Disallow /scripture-book/*
Allow /wp-admin/admin-ajax.php