iu.edu
robots.txt
Robots Exclusion Standard data for iu.edu
Resource Scan
Scan Details
Site Domain | iu.edu |
Base Domain | iu.edu |
Scan Status | Ok |
Last Scan | 2024-09-28T16:36:44+00:00 |
Next Scan | 2024-10-28T16:36:44+00:00 |
Last Scan
Scanned | 2024-09-28T16:36:44+00:00 |
URL | https://iu.edu/robots.txt |
Redirect | https://www.iu.edu/robots.txt |
Redirect Domain | www.iu.edu |
Redirect Base | iu.edu |
Domain IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Redirect IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Response IP | 129.79.123.142 |
Found | Yes |
Hash | 4a985ec8bc05ce2766f4d69478e553728deb6d007c419807cf790b12dcc680c8 |
SimHash | 7961ebd44b8c |
Groups
*
Rule | Path |
---|---|
Disallow | /tomorrow/ |
Disallow | /_archive/ |
Disallow | /_css/ |
Disallow | /_dev/ |
Disallow | /_includes/ |
Disallow | /_internal/ |
Disallow | /_js/ |
Disallow | /_links/ |
Disallow | /_php/ |
Disallow | /_shared/ |
Disallow | /error/ |
Disallow | /gwassets/ |
Disallow | /machform/ |
Disallow | /mobile/ |
Disallow | /search/index.html |
Disallow | /search/index.htm |
Disallow | /search/index.shtml |
Other Records
Field | Value |
---|---|
sitemap | https://www.iu.edu/sitemap.xml |