research.rug.nl
robots.txt

Robots Exclusion Standard data for research.rug.nl

Resource Scan

Scan Details

Site Domain research.rug.nl
Base Domain rug.nl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-03-25T16:01:27+00:00
Next Scan 2025-05-24T16:01:27+00:00

Last Successful Scan

Scanned2025-01-02T16:00:24+00:00
URL https://research.rug.nl/robots.txt
Domain IPs 104.18.39.240, 172.64.148.16
Response IP 104.18.39.240
Found Yes
Hash 879532300d2ae3c37c73edfa4665a12dd9f14d745c82ca0749b3a3a480641a66
SimHash 633cd830a373

Groups

*

Rule Path
Disallow /*?*format=rss
Disallow /*?*export=xls

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://research.rug.nl/sitemap.xml