sice.indiana.edu
robots.txt
Robots Exclusion Standard data for sice.indiana.edu
Resource Scan
Scan Details
Site Domain | sice.indiana.edu |
Base Domain | indiana.edu |
Scan Status | Ok |
Last Scan | 2024-10-05T15:51:20+00:00 |
Next Scan | 2024-11-04T15:51:20+00:00 |
Last Scan
Scanned | 2024-10-05T15:51:20+00:00 |
URL | https://sice.indiana.edu/robots.txt |
Redirect | https://luddy.indiana.edu/robots.txt |
Redirect Domain | luddy.indiana.edu |
Redirect Base | indiana.edu |
Domain IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Redirect IPs | 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e |
Response IP | 129.79.123.142 |
Found | Yes |
Hash | abea5c96dc46c795b8c86d54c1a4aa64ce7a1a7d40395636de701869124f6491 |
SimHash | 4f01a9f0fb9f |
Groups
*
Rule | Path |
---|---|
Disallow | /_documentation/ |
Disallow | /_examples/ |
Disallow | /_feeds/ |
Disallow | /_home/ |
Disallow | /_includes/ |
Disallow | /_internal/ |
Disallow | /_php/ |
Disallow | /_shared-sections/ |
Disallow | /chunks/ |
Disallow | /error/ |
Disallow | /gwassets/ |
Disallow | /machform/ |
Disallow | /_assets/ |
Disallow | /search/index.html |
Disallow | /search/index.htm |
Disallow | /search/index.shtml |
Other Records
Field | Value |
---|---|
sitemap | https://luddy.indiana.edu/sitemap.xml |