uwhealth.org
robots.txt

Robots Exclusion Standard data for uwhealth.org

Resource Scan

Scan Details

Site Domain uwhealth.org
Base Domain uwhealth.org
Scan Status Ok
Last Scan2024-05-10T20:53:03+00:00
Next Scan 2024-06-09T20:53:03+00:00

Last Scan

Scanned2024-05-10T20:53:03+00:00
URL https://uwhealth.org/robots.txt
Redirect https://www.uwhealth.org/robots.txt
Redirect Domain www.uwhealth.org
Redirect Base uwhealth.org
Domain IPs 147.75.40.150
Redirect IPs 13.215.246.123, 13.215.31.72, 2406:da18:b3d:e201::1f4, 2406:da18:b3d:e202::1f4
Response IP 13.215.31.72
Found Yes
Hash a382e06ab8a3aab17d415c20784fcb6e181850964355de51e03204f91e0f9654
SimHash bd35c09ccba0

Groups

*

Rule Path
Allow /
Disallow /ad/*
Disallow /qa_*
Disallow /*/qa_*
Disallow /components*
Disallow /*/components*
Disallow /*jsessionid*
Disallow /files/*
Disallow /machform/
Disallow /q/
Disallow /search/
Disallow /beacon-protocols/
Disallow /findadoctor/search
Disallow /*news-archive*
Disallow /news/main/10435
Disallow /news/events/25067
Disallow /news/news-from-the-uw-school-of-medicine-and-public-health/51817
Disallow /healthfacts/spanish*
Disallow /locations/kids*
Disallow /home-access/*
Disallow /our-services/center-for-wellness/class-registration-landing/46914*
Disallow /files-directory/position-descriptions/zrecruitment-information/
Disallow /files-directory/position-descriptions/1-benefit-information/
Disallow /files-directory/position-descriptions/1cloud/
Disallow /files-directory/position-descriptions/2017-benefit-information/

Other Records

Field Value
sitemap https://www.uwhealth.org/sitemap.xml

Comments

  • misc stuff
  • QA pages and url patterns
  • old blocking from original site