usask.ca
robots.txt
Robots Exclusion Standard data for usask.ca
Resource Scan
Scan Details
Site Domain | usask.ca |
Base Domain | usask.ca |
Scan Status | Ok |
Last Scan | 2024-10-29T21:30:55+00:00 |
Next Scan | 2024-11-28T21:30:55+00:00 |
Last Scan
Scanned | 2024-10-29T21:30:55+00:00 |
URL | https://usask.ca/robots.txt |
Redirect | https://www.usask.ca/robots.txt |
Redirect Domain | www.usask.ca |
Redirect Base | usask.ca |
Domain IPs | 128.233.195.103 |
Redirect IPs | 128.233.198.205, 2620:ae:0:1172:2840:d79f:ea47:75a4 |
Response IP | 128.233.198.205 |
Found | Yes |
Hash | 341d9820265a95a670cda0b9fab7bf12078b5ef04cfd628c0488d39ec772ed5a |
SimHash | be9d912287f2 |
Groups
*
Rule | Path | Comment |
---|---|---|
Disallow | /cgi-bin/ | includes some large virtual spaces |
Disallow | /test/ | - |
Disallow | /test.php | - |
Disallow | /_uofs-codebase/ | - |
Disallow | /_uofs-site-basic/ | - |
Disallow | /_usask/ | - |
Disallow | /arts-sandbox/ | - |
Disallow | /wcs-sandbox/ | - |
Disallow | /wcms-sandbox/ | - |
Disallow | /usaskcdn-sandbox/ | - |
Comments