csgsu.co.uk
robots.txt

Robots Exclusion Standard data for csgsu.co.uk

Resource Scan

Scan Details

Site Domain csgsu.co.uk
Base Domain csgsu.co.uk
Scan Status Ok
Last Scan2025-07-01T11:53:12+00:00
Next Scan 2025-07-31T11:53:12+00:00

Last Scan

Scanned2025-07-01T11:53:12+00:00
URL https://csgsu.co.uk/robots.txt
Redirect https://www.csgsu.co.uk/robots.txt
Redirect Domain www.csgsu.co.uk
Redirect Base csgsu.co.uk
Domain IPs 20.162.177.142
Redirect IPs 20.162.177.142
Response IP 20.162.177.142
Found Yes
Hash 699510f594e21bbd87a8660e2ca1222a866c3ce3edf2c347d3462602cb36e76d
SimHash 4c035bd28741

Groups

googlebot

Rule Path
Allow /pagestylesheet/
Allow /stylesheet/
Allow /skins/
Disallow /photos/
Disallow /advertclick/
Disallow /login/
Disallow /resourcehandler/
Disallow /edit/
Disallow /search/
Disallow /asset/
Disallow /account/
Disallow /Shibboleth.sso
Disallow /sso/

twitterbot

Rule Path
Allow /stylesheet/
Allow /asset/
Disallow /photos/
Disallow /advertclick/
Disallow /login/
Disallow /pagestylesheet/
Disallow /skins/
Disallow /resourcehandler/
Disallow /edit/
Disallow /search/
Disallow /account/
Disallow /Shibboleth.sso
Disallow /sso/

*

Rule Path
Disallow /photos/
Disallow /advertclick/
Disallow /login/
Disallow /pagestylesheet/
Disallow /stylesheet/
Disallow /skins/
Disallow /resourcehandler/
Disallow /edit/
Disallow /search/
Disallow /asset/
Disallow /account/
Disallow /Shibboleth.sso
Disallow /sso/