iuhealth.org
robots.txt

Robots Exclusion Standard data for iuhealth.org

Resource Scan

Scan Details

Site Domain iuhealth.org
Base Domain iuhealth.org
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-04-23T14:12:13+00:00
Next Scan 2024-07-22T14:12:13+00:00

Last Successful Scan

Scanned2023-12-03T14:09:58+00:00
URL https://iuhealth.org/robots.txt
Domain IPs 2600:1413:b000:6::17d5:2bc4, 2600:1413:b000:6::17d5:2bd8, 96.17.96.29, 96.17.96.31
Response IP 23.44.4.129
Found Yes
Hash 755208d48a58afb2969f4aa41e1c388bf590d1f72e66bd6019536fe6c42c6fdd
SimHash b61dd38acdea

Groups

rogerbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

*

Rule Path
Disallow *api.iuhealth.org*
Disallow *stage.iuhealth.org*
Disallow /ux/*
Disallow *appointment-request-thank-you
Disallow *patient-postcard-thank-you
Disallow *patient-referral-thank-you
Disallow /thank-you
Disallow /breast-pump-thank-you
Disallow /classes-events?relatedTo=*
Disallow /find-locations/results?*
Disallow /for-media/page/*
Disallow /for-media/press-releases?location=*
Disallow /for-media/press-releases/page/*
Disallow /for-media/press-releases/archive/year/*
Disallow /thrive/category/*
Disallow /thrive/index/page/*
Disallow /thrive/index?location=*
Disallow /thrive/page/*
Disallow /thrive/tag/*
Disallow /search
Disallow /search*

Other Records

Field Value
sitemap https://iuhealth.org/sitemap.xml

Comments

  • Exclude thankyou pages from SERP - Begin
  • Exclude thankyou pages from SERP - END
  • Exclude URL params from Google Search Console - Begin
  • Exclude URL params from Google Search Console - END
  • Add in sitemap to assist search engines