uth.edu
robots.txt

Robots Exclusion Standard data for uth.edu

Resource Scan

Scan Details

Site Domain uth.edu
Base Domain uth.edu
Scan Status Ok
Last Scan2024-09-20T15:04:07+00:00
Next Scan 2024-10-20T15:04:07+00:00

Last Scan

Scanned2024-09-20T15:04:07+00:00
URL https://uth.edu/robots.txt
Redirect https://www.uth.edu/robots.txt
Redirect Domain www.uth.edu
Redirect Base uth.edu
Domain IPs 129.106.32.54
Redirect IPs 104.18.10.7, 104.18.11.7, 2606:4700::6812:a07, 2606:4700::6812:b07
Response IP 104.18.11.7
Found Yes
Hash 0399fc50697efbb8c9c576c01da23849e092c7488a7e934a4e8301171164674e
SimHash 5d8b8873f513

Groups

googlebot

Rule Path
Disallow /emergency/
Disallow /hr-new/
Disallow /test-folders/
Disallow /contentAsset/
Disallow /dotAsset/
Disallow /age-inclusively/
Disallow /international-affairs/document/
Disallow /it-new/

screaming frog seo spider

Rule Path
Disallow /dotAdmin/
Disallow /temp/
Disallow /test-folders/
Disallow /training/
Disallow */content-modal.htm
Disallow */studenthealth-old
Disallow /sarofim/
Disallow /sonscc/
Disallow /fact-book/archives/
Disallow /age-inclusively/
Disallow /emergency/
Disallow /academics/applicants/archived-school-catalogs.htm
Disallow /global/youtubemodal
Disallow /hr-new/
Disallow /international-affairs/document/
Disallow /it-new/

*

Rule Path
Disallow /dotAdmin/
Disallow /temp/
Disallow /test-folders/
Disallow /training/
Disallow */content-modal.htm
Disallow */studenthealth-old
Disallow /sarofim/
Disallow /hr-new/
Disallow /emergency/
Disallow /sonscc/
Disallow /fact-book/archives/
Disallow /academics/applicants/archived-school-catalogs.htm
Disallow /global/youtubemodal/
Disallow /it-new/

Other Records

Field Value
sitemap https://www.uth.edu/uth-public-sitemap.xml

Comments

  • Allow Screaming Frog crawler
  • Allow all other crawlers