clarionherald.org
robots.txt

Robots Exclusion Standard data for clarionherald.org

Resource Scan

Scan Details

Site Domain clarionherald.org
Base Domain clarionherald.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-18T04:27:09+00:00
Next Scan 2024-11-17T04:27:09+00:00

Last Successful Scan

Scanned2023-02-03T22:14:37+00:00
URL https://clarionherald.org/robots.txt
Domain IPs 35.171.57.87, 52.21.5.176
Response IP 52.21.5.176
Found Yes
Hash a664731054aa2fd7d4d76f6ef992258273cd470d34f3626422791883617aacb3
SimHash 6154d8524731

Groups

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow
Allow /

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://clarionherald.org/sitemap22596.xml