research.polyu.edu.hk
robots.txt

Robots Exclusion Standard data for research.polyu.edu.hk

Resource Scan

Scan Details

Site Domain research.polyu.edu.hk
Base Domain polyu.edu.hk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan3/25/2025, 8:24:48 PM
Next Scan 5/24/2025, 8:24:48 PM

Last Successful Scan

Scanned1/2/2025, 8:22:42 PM
URL https://research.polyu.edu.hk/robots.txt
Domain IPs 104.18.39.240, 172.64.148.16
Response IP 172.64.148.16
Found Yes
Hash 4a7925f86f7a84252c7db695d254456fa88bc2632a6da6b89520ed316eb83735
SimHash 693c5860e133

Groups

*

Rule Path
Disallow /*?*format=rss
Disallow /*?*export=xls

Other Records

Field Value
crawl-delay 5

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://research.polyu.edu.hk/sitemap.xml