indiana.edu
robots.txt

Robots Exclusion Standard data for indiana.edu

Resource Scan

Scan Details

Site Domain indiana.edu
Base Domain indiana.edu
Scan Status Ok
Last Scan2024-11-04T14:58:51+00:00
Next Scan 2024-12-04T14:58:51+00:00

Last Scan

Scanned2024-11-04T14:58:51+00:00
URL https://indiana.edu/robots.txt
Redirect https://bloomington.iu.edu/robots.txt
Redirect Domain bloomington.iu.edu
Redirect Base iu.edu
Domain IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Redirect IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Response IP 129.79.123.143
Found Yes
Hash 949908acb768f849b704c3faaf076f9b81fcb5af22bdce9774ae567177da0385
SimHash 4f0da9d47bdd

Groups

adsbot-google-mobile

Rule Path
Allow /campaigns/

adsbot-google

Rule Path
Allow /campaigns/

*

Rule Path
Disallow /_documentation/
Disallow /_examples/
Disallow /_feeds/
Disallow /_home/
Disallow /_includes/
Disallow /_internal/
Disallow /_php/
Disallow /_shared-sections/
Disallow /chunks/
Disallow /error/
Disallow /gwassets/
Disallow /machform/
Disallow /thank-you/
Disallow /_assets/
Disallow /campaigns/
Disallow /search/index.html
Disallow /search/index.htm
Disallow /search/index.shtml

Other Records

Field Value
sitemap https://bloomington.iu.edu/sitemap.xml