library.usask.ca
robots.txt
Robots Exclusion Standard data for library.usask.ca
Resource Scan
Scan Details
Site Domain | library.usask.ca |
Base Domain | usask.ca |
Scan Status | Ok |
Last Scan | 2025-08-16T17:18:24+00:00 |
Next Scan | 2025-09-15T17:18:24+00:00 |
Last Scan
Scanned | 2025-08-16T17:18:24+00:00 |
URL | https://library.usask.ca/robots.txt |
Domain IPs | 128.233.198.202 |
Response IP | 128.233.198.202 |
Found | Yes |
Hash | 54f706ee67eec01fbe74cadd9501ff7dded0db1988769dfa321e5a68fd4a30cd |
SimHash | cb600c22cac8 |
Groups
*
Rule | Path |
---|---|
Disallow | /_php/ |
Disallow | /cgi-bin/ |
Disallow | /inventory/ |
Disallow | /intranet/ |
Disallow | /scripts/ |
Disallow | /studyrooms/ |
Disallow | /paws/ |
Disallow | /web-feedback/ |
Disallow | /authenticator.php |
Disallow | /debug.php |
Disallow | /login.php |
Disallow | /logout.php |
Disallow | /new_resources/rss.php |
Disallow | /search.php |
Disallow | /thanks.php |
Disallow | /studyrooms |
Disallow | /test/ |
Disallow | /textbooks/solr |
Disallow | /gp/ |
Disallow | /courtneymilne/islandora/search/ |
Other Records
Field | Value |
---|---|
crawl-delay | 10 |