www.hou.usra.edu
robots.txt

Robots Exclusion Standard data for www.hou.usra.edu

Resource Scan

Scan Details

Site Domain www.hou.usra.edu
Base Domain usra.edu
Scan Status Ok
Last Scan2025-11-20T07:57:34+00:00
Next Scan 2025-12-20T07:57:34+00:00

Last Scan

Scanned2025-11-20T07:57:34+00:00
URL https://www.hou.usra.edu/robots.txt
Domain IPs 104.20.25.112, 172.66.155.178, 2606:4700:10::6814:1970, 2606:4700:10::ac42:9bb2
Response IP 172.66.155.178
Found Yes
Hash 91cef19549c101d78163a535d7d1fd64c7fa376f8a8bf77350ccc9aa9b3107be
SimHash 1e32c3278ff2

Groups

amazonbot

Rule Path
Disallow /meetings/lpsc2020/eposterindex.cfm
Disallow /meetings/lpsc2020/eposter/
Disallow /meetings/lpsc2020/pdf/
Disallow /meetings/*/eposterindex.cfm

*

Rule Path
Disallow *.zip$
Disallow */private/
Disallow */internal/
Disallow /meetings/*/eposterindex.cfm/

Comments

  • robots.txt for http://www.hou.usra.edu/