uit.ac.id
robots.txt

Robots Exclusion Standard data for uit.ac.id

Resource Scan

Scan Details

Site Domain uit.ac.id
Base Domain uit.ac.id
Scan Status Ok
Last Scan2025-05-14T10:35:12+00:00
Next Scan 2025-06-13T10:35:12+00:00

Last Scan

Scanned2025-05-14T10:35:12+00:00
URL https://uit.ac.id/robots.txt
Domain IPs 104.21.90.91, 172.67.198.89, 2606:4700:3033::6815:5a5b, 2606:4700:3033::ac43:c659
Response IP 104.21.90.91
Found Yes
Hash bfd1e8773465bc6457d1fbf60754b8007cb8026c175652ac0b8be6f2ac7afaaa
SimHash 8d3c5d22e194

Groups

*

Rule Path
Disallow /.github/
Disallow /.phan/
Disallow /assets/
Disallow /backup/
Disallow /bin/
Disallow /cache/
Disallow /logs/
Disallow /system/
Disallow /tests/
Disallow /tmp/
Disallow /user/
Disallow /vendor/
Disallow /webserver-configs/
Allow /user/pages/
Allow /user/themes/
Allow /user/images/
Allow /
Allow *.css$
Allow *.js$
Allow /system/*.js$