harrowonline.org
robots.txt
Robots Exclusion Standard data for harrowonline.org
Resource Scan
Scan Details
Site Domain | harrowonline.org |
Base Domain | harrowonline.org |
Scan Status | Ok |
Last Scan | 2024-10-28T05:35:12+00:00 |
Next Scan | 2024-11-04T05:35:12+00:00 |
Last Scan
Scanned | 2024-10-28T05:35:12+00:00 |
URL | https://harrowonline.org/robots.txt |
Domain IPs | 35.214.117.240 |
Response IP | 35.214.117.240 |
Found | Yes |
Hash | d64d519504514ba70f305f5db576db3a0bd56fe838a4361eb387e2852c2d3fea |
SimHash | 4a494c582e31 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Disallow | /wp-includes/ |
Allow | /wp-includes/js/ |
Allow | /wp-content/plugins/ |
Allow | /wp-content/themes/ |
Allow | /wp-content/cache/ |
Disallow | /xmlrpc.php |
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
Other Records
Field | Value |
---|---|
sitemap | https://harrowonline.org/sitemap_index.xml |
Comments