st-andrews.ac.uk
robots.txt

Robots Exclusion Standard data for st-andrews.ac.uk

Resource Scan

Scan Details

Site Domain st-andrews.ac.uk
Base Domain st-andrews.ac.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-23T21:44:57+00:00
Next Scan 2025-11-22T21:44:57+00:00

Last Successful Scan

Scanned2025-09-17T06:42:47+00:00
URL https://st-andrews.ac.uk/robots.txt
Redirect https://www.st-andrews.ac.uk/robots.txt
Redirect Domain www.st-andrews.ac.uk
Redirect Base st-andrews.ac.uk
Domain IPs 138.251.7.84
Redirect IPs 138.251.7.84
Response IP 138.251.7.84
Found Yes
Hash 6197a159bf19a6fa8c22f49d0d00397eeed95b2bfef9011d2330430bced2fc6b
SimHash b91ad86a2631

Groups

*

Rule Path
Disallow /jira/
Disallow /~www_pa/
Disallow /~psst/
Disallow /subjects/archive/
Disallow /schoolpeoplepublic/*
Disallow /*?*skin=0
Disallow /subjects/modules/search/
Disallow /subjects/modules/advisers-search/
Disallow /subjects/reqs/
Disallow /subjects/specs/
Disallow /philevents/
Disallow /php/
Disallow /s/
Disallow /s1/
Disallow /search/
Disallow /collections/
Disallow /photo-gallery/
Disallow /~awm2/

Other Records

Field Value
crawl-delay 10

petalbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

httrack

No rules defined. All paths allowed.