iu.edu
robots.txt
Robots Exclusion Standard data for iu.edu
Resource Scan
Scan Details
Site Domain | iu.edu |
Base Domain | iu.edu |
Scan Status | Ok |
Last Scan | 2024-10-28T16:37:07+00:00 |
Next Scan | 2024-11-27T16:37:07+00:00 |
Last Scan
Scanned | 2024-10-28T16:37:07+00:00 |
URL | https://iu.edu/robots.txt |
Redirect | https://www.iu.edu/robots.txt |
Redirect Domain | www.iu.edu |
Redirect Base | iu.edu |
Domain IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Redirect IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Response IP | 129.79.123.143 |
Found | Yes |
Hash | de218db4a4732a35192ecfbf27607a95acefe136c9b8cb2d80348e45c31e0af3 |
SimHash | 6961ead06b8e |
Groups
*
Rule | Path |
---|---|
Disallow | /president/communications/vip-updates/index.html |
Disallow | /tomorrow/ |
Disallow | /_archive/ |
Disallow | /_css/ |
Disallow | /_dev/ |
Disallow | /_includes/ |
Disallow | /_internal/ |
Disallow | /_js/ |
Disallow | /_links/ |
Disallow | /_php/ |
Disallow | /_shared/ |
Disallow | /error/ |
Disallow | /gwassets/ |
Disallow | /machform/ |
Disallow | /mobile/ |
Disallow | /search/index.html |
Disallow | /search/index.htm |
Disallow | /search/index.shtml |
Other Records
Field | Value |
---|---|
sitemap | https://www.iu.edu/sitemap.xml |