east.iu.edu
robots.txt
Robots Exclusion Standard data for east.iu.edu
Resource Scan
Scan Details
Site Domain | east.iu.edu |
Base Domain | iu.edu |
Scan Status | Ok |
Last Scan | 2024-08-31T18:09:25+00:00 |
Next Scan | 2024-09-30T18:09:25+00:00 |
Last Scan
Scanned | 2024-08-31T18:09:25+00:00 |
URL | https://east.iu.edu/robots.txt |
Domain IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Response IP | 129.79.123.142 |
Found | Yes |
Hash | b5423cde5859a165dee9b3a22a37f01c759d34dec7bec741b1ee611d0a0070f7 |
SimHash | 4f2ce8f4eb9f |
Groups
*
Rule | Path |
---|---|
Disallow | /_documentation/ |
Disallow | /_examples/ |
Disallow | /_feeds/ |
Disallow | /_home/ |
Disallow | /_includes/ |
Disallow | /_internal/ |
Disallow | /_php/ |
Disallow | /_shared-sections/ |
Disallow | /chunks/ |
Disallow | /error/ |
Disallow | /gwassets/ |
Disallow | /machform/ |
Disallow | /_assets/ |
Disallow | /search/index.html |
Disallow | /search/index.htm |
Disallow | /search/index.shtml |
Other Records
Field | Value |
---|---|
sitemap | https://east.iu.edu/sitemap.xml |