luddy.indiana.edu
robots.txt

Robots Exclusion Standard data for luddy.indiana.edu

Resource Scan

Scan Details

Site Domain luddy.indiana.edu
Base Domain indiana.edu
Scan Status Ok
Last Scan2024-09-28T10:22:33+00:00
Next Scan 2024-10-28T10:22:33+00:00

Last Scan

Scanned2024-09-28T10:22:33+00:00
URL https://luddy.indiana.edu/robots.txt
Domain IPs 129.79.123.142, 129.79.123.143, 2001:18e8:2:e::11d, 2001:18e8:2:e::11e
Response IP 129.79.123.142
Found Yes
Hash abea5c96dc46c795b8c86d54c1a4aa64ce7a1a7d40395636de701869124f6491
SimHash 4f01a9f0fb9f

Groups

adsbot-google-mobile

Rule Path
Allow /campaigns/

adsbot-google

Rule Path
Allow /campaigns/

*

Rule Path
Disallow /_documentation/
Disallow /_examples/
Disallow /_feeds/
Disallow /_home/
Disallow /_includes/
Disallow /_internal/
Disallow /_php/
Disallow /_shared-sections/
Disallow /chunks/
Disallow /error/
Disallow /gwassets/
Disallow /machform/
Disallow /_assets/
Disallow /search/index.html
Disallow /search/index.htm
Disallow /search/index.shtml

Other Records

Field Value
sitemap https://luddy.indiana.edu/sitemap.xml