www.iidc.indiana.edu
robots.txt
Robots Exclusion Standard data for www.iidc.indiana.edu
Resource Scan
Scan Details
Site Domain | www.iidc.indiana.edu |
Base Domain | indiana.edu |
Scan Status | Ok |
Last Scan | 2024-09-27T20:28:58+00:00 |
Next Scan | 2024-10-27T20:28:58+00:00 |
Last Scan
Scanned | 2024-09-27T20:28:58+00:00 |
URL | https://www.iidc.indiana.edu/robots.txt |
Domain IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Response IP | 129.79.123.142 |
Found | Yes |
Hash | 253a45f385913c53e0fc8d3ea70ea8fd284ad5224875e251b4149380485fa60c |
SimHash | 6f09a9f0fbde |
Groups
*
Rule | Path |
---|---|
Disallow | /_documentation/ |
Disallow | /_examples/ |
Disallow | /_feeds/ |
Disallow | /_home/ |
Disallow | /_includes/ |
Disallow | /_internal/ |
Disallow | /_php/ |
Disallow | /_shared-sections/ |
Disallow | /chunks/ |
Disallow | /error/ |
Disallow | /gwassets/ |
Disallow | /machform/ |
Disallow | /_assets/ |
Disallow | /search/index.html |
Disallow | /search/index.htm |
Disallow | /search/index.shtml |
Other Records
Field | Value |
---|---|
sitemap | https://www.iidc.indiana.edu/sitemap.xml |