crymca.org
robots.txt
Robots Exclusion Standard data for crymca.org
Resource Scan
Scan Details
Site Domain | crymca.org |
Base Domain | crymca.org |
Scan Status | Ok |
Last Scan | 2025-10-08T09:21:42+00:00 |
Next Scan | 2025-11-07T09:21:42+00:00 |
Last Scan
Scanned | 2025-10-08T09:21:42+00:00 |
URL | https://crymca.org/robots.txt |
Domain IPs | 104.21.59.170, 172.67.181.92, 2606:4700:3032::6815:3baa, 2606:4700:3033::ac43:b55c |
Response IP | 172.67.181.92 |
Found | Yes |
Hash | 8d5eef3860745d4a9f1e714f7a01b52aa86e8284cdee4dcc7a8142db7a55543f |
SimHash | c2350d53c5d4 |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /administrator/ |
Disallow | /bin/ |
Disallow | /cache/ |
Disallow | /cli/ |
Disallow | /components/ |
Disallow | /includes/ |
Disallow | /installation/ |
Disallow | /language/ |
Disallow | /layouts/ |
Disallow | /libraries/ |
Disallow | /logs/ |
Disallow | /modules/ |
Disallow | /plugins/ |
Disallow | /tmp/ |
Warnings
- `content-signal` is not a known field.
Comments