indiana.edu
robots.txt
Robots Exclusion Standard data for indiana.edu
Resource Scan
Scan Details
Site Domain | indiana.edu |
Base Domain | indiana.edu |
Scan Status | Ok |
Last Scan | 2024-11-04T14:58:51+00:00 |
Next Scan | 2024-12-04T14:58:51+00:00 |
Last Scan
Scanned | 2024-11-04T14:58:51+00:00 |
URL | https://indiana.edu/robots.txt |
Redirect | https://bloomington.iu.edu/robots.txt |
Redirect Domain | bloomington.iu.edu |
Redirect Base | iu.edu |
Domain IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Redirect IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Response IP | 129.79.123.143 |
Found | Yes |
Hash | 949908acb768f849b704c3faaf076f9b81fcb5af22bdce9774ae567177da0385 |
SimHash | 4f0da9d47bdd |
Groups
*
Rule | Path |
---|---|
Disallow | /_documentation/ |
Disallow | /_examples/ |
Disallow | /_feeds/ |
Disallow | /_home/ |
Disallow | /_includes/ |
Disallow | /_internal/ |
Disallow | /_php/ |
Disallow | /_shared-sections/ |
Disallow | /chunks/ |
Disallow | /error/ |
Disallow | /gwassets/ |
Disallow | /machform/ |
Disallow | /thank-you/ |
Disallow | /_assets/ |
Disallow | /campaigns/ |
Disallow | /search/index.html |
Disallow | /search/index.htm |
Disallow | /search/index.shtml |
Other Records
Field | Value |
---|---|
sitemap | https://bloomington.iu.edu/sitemap.xml |