iu.edu
robots.txt

Robots Exclusion Standard data for iu.edu

Resource Scan

Scan Details

Site Domain iu.edu
Base Domain iu.edu
Scan Status Ok
Last Scan2024-10-28T16:37:07+00:00
Next Scan 2024-11-27T16:37:07+00:00

Last Scan

Scanned2024-10-28T16:37:07+00:00
URL https://iu.edu/robots.txt
Redirect https://www.iu.edu/robots.txt
Redirect Domain www.iu.edu
Redirect Base iu.edu
Domain IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Redirect IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Response IP 129.79.123.143
Found Yes
Hash de218db4a4732a35192ecfbf27607a95acefe136c9b8cb2d80348e45c31e0af3
SimHash 6961ead06b8e

Groups

adsbot-google-mobile

Rule Path
Allow /campaigns/

adsbot-google

Rule Path
Allow /campaigns/

*

Rule Path
Disallow /president/communications/vip-updates/index.html
Disallow /tomorrow/
Disallow /_archive/
Disallow /_css/
Disallow /_dev/
Disallow /_includes/
Disallow /_internal/
Disallow /_js/
Disallow /_links/
Disallow /_php/
Disallow /_shared/
Disallow /error/
Disallow /gwassets/
Disallow /machform/
Disallow /mobile/
Disallow /search/index.html
Disallow /search/index.htm
Disallow /search/index.shtml

Other Records

Field Value
sitemap https://www.iu.edu/sitemap.xml