uh.edu
robots.txt

Robots Exclusion Standard data for uh.edu

Resource Scan

Scan Details

Site Domain uh.edu
Base Domain uh.edu
Scan Status Ok
Last Scan2024-09-14T18:23:58+00:00
Next Scan 2024-10-14T18:23:58+00:00

Last Scan

Scanned2024-09-14T18:23:58+00:00
URL https://uh.edu/robots.txt
Domain IPs 129.7.97.54
Response IP 129.7.97.54
Found Yes
Hash 6f5101607cabd9b17f9bb0deca597434e6da9b472fff9937740ebf5e2eb25a2c
SimHash d9175cd2c6f8

Groups

*

Rule Path
Disallow /archives
Disallow /search/
Disallow /infotech/services/accessuh/
Disallow /~bhakta/
Disallow /academics/catalog/archive
Disallow /grad-catalog-archive
Disallow /honors/machform/machform/
Disallow /hilton-college/forms/machform/
Disallow /brand/_img/stretch1.png
Disallow /brand/_img/stretch2.png
Disallow /brand/_img/dropshadow1.png
Disallow /brand/_img/dropshadow2.png
Disallow /brand/_img/recolor1.png
Disallow /brand/_img/export2.png
Disallow /brand/_img/transparent1.png
Disallow /brand/_img/reposition2.png
Disallow /brand/_img/rotate1.png
Disallow /brand/_img/transparencylogo.png
Disallow */calendar/?*
Disallow */calendar/index.php?*
Disallow */calendar/index?*

bytespider

Rule Path
Disallow /