smuc.ac.uk
robots.txt

Robots Exclusion Standard data for smuc.ac.uk

Resource Scan

Scan Details

Site Domain smuc.ac.uk
Base Domain smuc.ac.uk
Scan Status Ok
Last Scan2024-08-31T10:21:24+00:00
Next Scan 2024-09-30T10:21:24+00:00

Last Scan

Scanned2024-08-31T10:21:24+00:00
URL https://www.smuc.ac.uk/robots.txt
Redirect https://www.stmarys.ac.uk/robots.txt
Redirect Domain www.stmarys.ac.uk
Redirect Base stmarys.ac.uk
Domain IPs 185.18.139.29
Redirect IPs 185.18.139.29
Response IP 185.18.139.29
Found Yes
Hash 983ee48e79c12080d765e074a52df433d5645fc4d5b25d28a502ffa7b03bb3c7
SimHash 093aca9a4373

Groups

*

Rule Path
Disallow /forms/
Disallow /site-elements/img
Disallow /search-results.aspx
Disallow /oauth-callback.aspx
Disallow /vrview/
Disallow /links/
Disallow /media/
Disallow /site-info/error-page.aspx
Disallow /test-blog/
Disallow /chat/
Disallow /test/
Disallow /announcements/
Disallow /teacher-training/handbook/
Disallow /outreach/
Disallow /your-offer/
Disallow /undergraduate-degree/
Disallow /welcome/course/
Disallow /staff-directory/staff-profile-upload.aspx
Disallow /sitemaps/sitemap-generator.aspx
Disallow /*/pdf$
Disallow /application-process/docs/specs/
Disallow /staff/
Disallow /students/
Disallow /announcements/

ninjabot

Rule Path
Allow /