portal.research.lu.se
robots.txt

Robots Exclusion Standard data for portal.research.lu.se

Resource Scan

Scan Details

Site Domain portal.research.lu.se
Base Domain lu.se
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-03-25T08:23:16+00:00
Next Scan 2025-05-24T08:23:16+00:00

Last Successful Scan

Scanned2025-01-02T07:33:41+00:00
URL https://portal.research.lu.se/robots.txt
Domain IPs 104.18.39.240, 172.64.148.16
Response IP 172.64.148.16
Found Yes
Hash a68dd05d279c96cedbdc1518637d729fa721cba357e26376e8d98428922638b1
SimHash eb1d5830a173

Groups

*

Rule Path
Disallow /*?*format=rss
Disallow /*?*export=xls

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://portal.research.lu.se/sitemap.xml