connect.kuleuven.cloud
robots.txt

Robots Exclusion Standard data for connect.kuleuven.cloud

Resource Scan

Scan Details

Site Domain connect.kuleuven.cloud
Base Domain kuleuven.cloud
Scan Status Ok
Last Scan2025-09-30T21:01:40+00:00
Next Scan 2025-10-14T21:01:40+00:00

Last Scan

Scanned2025-09-30T21:01:40+00:00
URL https://connect.kuleuven.cloud/robots.txt
Domain IPs 104.19.244.91, 104.19.245.91, 2606:4700::6813:f45b, 2606:4700::6813:f55b
Response IP 104.19.244.91
Found Yes
Hash d204ff4e27778da95a0d26f458b48e1cce4836975bfc165bad25a69347c7d467
SimHash 2d559801e9f3

Groups

*

Rule Path
Disallow /pick-institution
Disallow /terms
Disallow /privacy-policy
Disallow /legal
Disallow /backoffice
Disallow /networks/*/recruiter/jobs

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

ut-dorkbot

Rule Path
Disallow /

ut-dorkbot/1.0

Rule Path
Disallow /

Other Records

Field Value
sitemap https://connect.kuleuven.cloud/sitemap.xml