cmri.org
robots.txt

Robots Exclusion Standard data for cmri.org

Resource Scan

Scan Details

Site Domain cmri.org
Base Domain cmri.org
Scan Status Ok
Last Scan2025-10-31T20:25:09+00:00
Next Scan 2025-11-30T20:25:09+00:00

Last Scan

Scanned2025-10-31T20:25:09+00:00
URL https://cmri.org/robots.txt
Domain IPs 50.87.230.23
Response IP 50.87.230.23
Found Yes
Hash badab0ea0bac6a49697b0a8fe7993a46317b02f6e7855606577d28a7e2ebde7e
SimHash 4d2945c9c153

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /cgi/
Disallow /logs/
Disallow /mail/
Disallow /*blackhole
Disallow /?blackhole

Other Records

Field Value
sitemap https://cmri.org/sitemap.xml
sitemap https://cmri.org/news-sitemap.xml