siu.edu
robots.txt

Robots Exclusion Standard data for siu.edu

Resource Scan

Scan Details

Site Domain siu.edu
Base Domain siu.edu
Scan Status Ok
Last Scan2024-10-26T04:37:24+00:00
Next Scan 2024-11-25T04:37:24+00:00

Last Scan

Scanned2024-10-26T04:37:24+00:00
URL https://siu.edu/robots.txt
Domain IPs 131.230.21.130
Response IP 131.230.21.130
Found Yes
Hash 485b447246dc397374784d548fc29d8181974e5e225586dfdc166831a0efd58b
SimHash 5a4cef716dbe

Groups

*

Rule Path
Allow /

*

Rule Path
Disallow /landing/
Disallow /Connections/
Disallow /_assets/images/css-images/
Disallow /plesk-stat/
Disallow /policies/
Disallow /test/
Disallow /WEB-INF/
Disallow /~adulted
Disallow /~africa
Disallow /~armyrotc
Disallow /~as
Disallow /~asaocap
Disallow /~delta
Disallow /~deweyctr
Disallow /~epse1
Disallow /~fao
Disallow /~fsolt
Disallow /~humres
Disallow /~jadams
Disallow /~mhebel
Disallow /~narijibon
Disallow /~pbgc/
Disallow /~protocell
Disallow /~pulfrich
Disallow /~rtate
Disallow /~siupress
Disallow /~wed08

googlebot-image

Rule Path
Allow /