arch-no.org
robots.txt

Robots Exclusion Standard data for arch-no.org

Resource Scan

Scan Details

Site Domain arch-no.org
Base Domain arch-no.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-03-16T17:38:17+00:00
Next Scan 2026-05-15T17:38:17+00:00

Last Successful Scan

Scanned2023-02-08T01:10:45+00:00
URL https://arch-no.org/robots.txt
Domain IPs 35.171.57.87, 52.21.5.176
Response IP 52.21.5.176
Found Yes
Hash c180a7675b0fb1df39fd8a4a555cbb8009f37e12a9cea4bf4414e5407938df22
SimHash 634cd851c391

Groups

mj12bot

Rule Path
Disallow /

*

Rule Path
Disallow
Allow /

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://nolacatholic.org/sitemap16596.xml