crls.501st.com
robots.txt

Robots Exclusion Standard data for crls.501st.com

Resource Scan

Scan Details

Site Domain crls.501st.com
Base Domain 501st.com
Scan Status Ok
Last Scan2025-12-08T23:10:56+00:00
Next Scan 2026-01-07T23:10:56+00:00

Last Scan

Scanned2025-12-08T23:10:56+00:00
URL https://crls.501st.com/robots.txt
Domain IPs 13.35.202.107, 13.35.202.38, 13.35.202.85, 13.35.202.92, 2600:9000:2078:1200:16:8365:c6c0:93a1, 2600:9000:2078:2800:16:8365:c6c0:93a1, 2600:9000:2078:3a00:16:8365:c6c0:93a1, 2600:9000:2078:5600:16:8365:c6c0:93a1, 2600:9000:2078:7e00:16:8365:c6c0:93a1, 2600:9000:2078:a600:16:8365:c6c0:93a1, 2600:9000:2078:e800:16:8365:c6c0:93a1, 2600:9000:2078:ea00:16:8365:c6c0:93a1
Response IP 13.35.202.85
Found Yes
Hash bfd1e8773465bc6457d1fbf60754b8007cb8026c175652ac0b8be6f2ac7afaaa
SimHash 8d3c5d22e194

Groups

*

Rule Path
Disallow /.github/
Disallow /.phan/
Disallow /assets/
Disallow /backup/
Disallow /bin/
Disallow /cache/
Disallow /logs/
Disallow /system/
Disallow /tests/
Disallow /tmp/
Disallow /user/
Disallow /vendor/
Disallow /webserver-configs/
Allow /user/pages/
Allow /user/themes/
Allow /user/images/
Allow /
Allow *.css$
Allow *.js$
Allow /system/*.js$