siu.edu
robots.txt
Robots Exclusion Standard data for siu.edu
Resource Scan
Scan Details
Site Domain | siu.edu |
Base Domain | siu.edu |
Scan Status | Ok |
Last Scan | 2024-10-26T04:37:24+00:00 |
Next Scan | 2024-11-25T04:37:24+00:00 |
Last Scan
Scanned | 2024-10-26T04:37:24+00:00 |
URL | https://siu.edu/robots.txt |
Domain IPs | 131.230.21.130 |
Response IP | 131.230.21.130 |
Found | Yes |
Hash | 485b447246dc397374784d548fc29d8181974e5e225586dfdc166831a0efd58b |
SimHash | 5a4cef716dbe |
Groups
*
Rule | Path |
---|---|
Allow | / |
*
Rule | Path |
---|---|
Disallow | /landing/ |
Disallow | /Connections/ |
Disallow | /_assets/images/css-images/ |
Disallow | /plesk-stat/ |
Disallow | /policies/ |
Disallow | /test/ |
Disallow | /WEB-INF/ |
Disallow | /~adulted |
Disallow | /~africa |
Disallow | /~armyrotc |
Disallow | /~as |
Disallow | /~asaocap |
Disallow | /~delta |
Disallow | /~deweyctr |
Disallow | /~epse1 |
Disallow | /~fao |
Disallow | /~fsolt |
Disallow | /~humres |
Disallow | /~jadams |
Disallow | /~mhebel |
Disallow | /~narijibon |
Disallow | /~pbgc/ |
Disallow | /~protocell |
Disallow | /~pulfrich |
Disallow | /~rtate |
Disallow | /~siupress |
Disallow | /~wed08 |