www.research.ed.ac.uk
robots.txt

Robots Exclusion Standard data for www.research.ed.ac.uk

Resource Scan

Scan Details

Site Domain www.research.ed.ac.uk
Base Domain ed.ac.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan3/25/2025, 11:53:14 AM
Next Scan 5/24/2025, 11:53:14 AM

Last Successful Scan

Scanned1/2/2025, 11:51:30 AM
URL https://www.research.ed.ac.uk/robots.txt
Domain IPs 104.18.39.240, 172.64.148.16
Response IP 104.18.39.240
Found Yes
Hash b28960571e235bb98550514284e24ae2a96b78097c7f4cbf64b2fcca292f6597
SimHash 6d3d5860e133

Groups

*

Rule Path
Disallow /*?*format=rss
Disallow /*?*export=xls

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.research.ed.ac.uk/sitemap.xml