soilcrust.org
robots.txt

Robots Exclusion Standard data for soilcrust.org

Resource Scan

Scan Details

Site Domain soilcrust.org
Base Domain soilcrust.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-13T23:03:52+00:00
Next Scan 2025-12-12T23:03:52+00:00

Last Successful Scan

Scanned2022-09-14T01:19:04+00:00
URL https://soilcrust.org/robots.txt
Response IP 104.21.71.91
Found Yes
Hash 025d2cdeb6d46625f81c47ac6edb4e34323164c09b997ce197f34124f425dd45
SimHash 41004c00cd92

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://soilcrust.org/sitemap_index.xml