hull.ac.uk
robots.txt

Robots Exclusion Standard data for hull.ac.uk

Resource Scan

Scan Details

Site Domain hull.ac.uk
Base Domain hull.ac.uk
Scan Status Ok
Last Scan2025-07-24T16:47:38+00:00
Next Scan 2025-08-23T16:47:38+00:00

Last Scan

Scanned2025-07-24T16:47:38+00:00
URL https://www.hull.ac.uk/robots.txt
Domain IPs 185.18.139.8
Response IP 185.18.139.8
Found Yes
Hash 273931f48d9b6bf1392013adb5896b4907948d324be219e2d73827d9e5f54951
SimHash 0a2998424d43

Groups

*

Rule Path
Allow /
Allow /editor-assets/images/*
Allow /editor-assets/docs/*
Allow /assets/editor/image-library/*
Allow */images/*
Allow */img/*
Allow */docs/*
Allow *.css
Allow *.js
Disallow /*?*
Disallow /search
Disallow /search/*
Disallow /search/*?*
Disallow /special/thank-you
Disallow /special/thank-you-*
Disallow /assets/*
Disallow /beta/*
Disallow /ucas/*
Disallow /site-elements/img/test-images/*
Disallow /website-assets/*
Disallow /test/*
Disallow /app_code/
Disallow /image-library/*

googlebot

Rule Path
Allow /assets/developer/css/*.css

Other Records

Field Value
sitemap https://www.hull.ac.uk/sitemap.xml
sitemap https://www.hull.ac.uk/page-sitemap.xml
sitemap https://www.hull.ac.uk/video-sitemap.xml