nrcat.org
robots.txt

Robots Exclusion Standard data for nrcat.org

Resource Scan

Scan Details

Site Domain nrcat.org
Base Domain nrcat.org
Scan Status Ok
Last Scan2026-01-07T17:48:58+00:00
Next Scan 2026-02-06T17:48:58+00:00

Last Scan

Scanned2026-01-07T17:48:58+00:00
URL https://www.nrcat.org/robots.txt
Domain IPs 104.26.4.30, 104.26.5.30, 172.67.74.163, 2606:4700:20::681a:41e, 2606:4700:20::681a:51e, 2606:4700:20::ac43:4aa3
Response IP 104.26.4.30
Found Yes
Hash ebf68125e82de8dfb92cebd91d3d9019f9a4003cbc5f7e2bb196e9acd21400ac
SimHash ab2dc45469d4

Groups

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /component/taxonomy/
Disallow /includes/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/