strathunion.com
robots.txt

Robots Exclusion Standard data for strathunion.com

Resource Scan

Scan Details

Site Domain strathunion.com
Base Domain strathunion.com
Scan Status Ok
Last Scan2025-07-21T15:02:19+00:00
Next Scan 2025-08-20T15:02:19+00:00

Last Scan

Scanned2025-07-21T15:02:19+00:00
URL https://strathunion.com/robots.txt
Redirect https://www.strathunion.com/robots.txt
Redirect Domain www.strathunion.com
Redirect Base strathunion.com
Domain IPs 20.162.177.142
Redirect IPs 20.162.177.142
Response IP 20.162.177.142
Found Yes
Hash 699510f594e21bbd87a8660e2ca1222a866c3ce3edf2c347d3462602cb36e76d
SimHash 4c035bd28741

Groups

googlebot

Rule Path
Allow /pagestylesheet/
Allow /stylesheet/
Allow /skins/
Disallow /photos/
Disallow /advertclick/
Disallow /login/
Disallow /resourcehandler/
Disallow /edit/
Disallow /search/
Disallow /asset/
Disallow /account/
Disallow /Shibboleth.sso
Disallow /sso/

twitterbot

Rule Path
Allow /stylesheet/
Allow /asset/
Disallow /photos/
Disallow /advertclick/
Disallow /login/
Disallow /pagestylesheet/
Disallow /skins/
Disallow /resourcehandler/
Disallow /edit/
Disallow /search/
Disallow /account/
Disallow /Shibboleth.sso
Disallow /sso/

*

Rule Path
Disallow /photos/
Disallow /advertclick/
Disallow /login/
Disallow /pagestylesheet/
Disallow /stylesheet/
Disallow /skins/
Disallow /resourcehandler/
Disallow /edit/
Disallow /search/
Disallow /asset/
Disallow /account/
Disallow /Shibboleth.sso
Disallow /sso/