tilburguniversity.libcal.com
robots.txt

Robots Exclusion Standard data for tilburguniversity.libcal.com

Resource Scan

Scan Details

Site Domain tilburguniversity.libcal.com
Base Domain libcal.com
Scan Status Ok
Last Scan2025-10-15T06:32:00+00:00
Next Scan 2025-11-14T06:32:00+00:00

Last Scan

Scanned2025-10-15T06:32:00+00:00
URL https://tilburguniversity.libcal.com/robots.txt
Domain IPs 34.248.143.50, 52.208.124.55, 63.33.224.20
Response IP 34.248.143.50
Found Yes
Hash 28d53f6332336da32fee038064d9f0cd7c23437d1f067ad6123a893b0cf93258
SimHash 0084dc00e683

Groups

crawl

Rule Path
Disallow /

crawler

Rule Path
Disallow /

discobot

Rule Path
Disallow /

addthis.com

Rule Path
Disallow /

yandex

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

twitterbot

Rule Path
Disallow

gptbot

Rule Path
Disallow /

*

Rule Path
Disallow /process_

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://tilburguniversity.libcal.com/sitemap.xml