iu.edu
robots.txt

Robots Exclusion Standard data for iu.edu

Resource Scan

Scan Details

Site Domain iu.edu
Base Domain iu.edu
Scan Status Ok
Last Scan2024-09-28T16:36:44+00:00
Next Scan 2024-10-28T16:36:44+00:00

Last Scan

Scanned2024-09-28T16:36:44+00:00
URL https://iu.edu/robots.txt
Redirect https://www.iu.edu/robots.txt
Redirect Domain www.iu.edu
Redirect Base iu.edu
Domain IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Redirect IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Response IP 129.79.123.142
Found Yes
Hash 4a985ec8bc05ce2766f4d69478e553728deb6d007c419807cf790b12dcc680c8
SimHash 7961ebd44b8c

Groups

adsbot-google-mobile

Rule Path
Allow /campaigns/

adsbot-google

Rule Path
Allow /campaigns/

*

Rule Path
Disallow /tomorrow/
Disallow /_archive/
Disallow /_css/
Disallow /_dev/
Disallow /_includes/
Disallow /_internal/
Disallow /_js/
Disallow /_links/
Disallow /_php/
Disallow /_shared/
Disallow /error/
Disallow /gwassets/
Disallow /machform/
Disallow /mobile/
Disallow /search/index.html
Disallow /search/index.htm
Disallow /search/index.shtml

Other Records

Field Value
sitemap https://www.iu.edu/sitemap.xml