library.usask.ca
robots.txt

Robots Exclusion Standard data for library.usask.ca

Resource Scan

Scan Details

Site Domain library.usask.ca
Base Domain usask.ca
Scan Status Ok
Last Scan2025-08-16T17:18:24+00:00
Next Scan 2025-09-15T17:18:24+00:00

Last Scan

Scanned2025-08-16T17:18:24+00:00
URL https://library.usask.ca/robots.txt
Domain IPs 128.233.198.202
Response IP 128.233.198.202
Found Yes
Hash 54f706ee67eec01fbe74cadd9501ff7dded0db1988769dfa321e5a68fd4a30cd
SimHash cb600c22cac8

Groups

*

Rule Path
Disallow /_php/
Disallow /cgi-bin/
Disallow /inventory/
Disallow /intranet/
Disallow /scripts/
Disallow /studyrooms/
Disallow /paws/
Disallow /web-feedback/
Disallow /authenticator.php
Disallow /debug.php
Disallow /login.php
Disallow /logout.php
Disallow /new_resources/rss.php
Disallow /search.php
Disallow /thanks.php
Disallow /studyrooms
Disallow /test/
Disallow /textbooks/solr
Disallow /gp/
Disallow /courtneymilne/islandora/search/

Other Records

Field Value
crawl-delay 10