owlnest.rice.edu
robots.txt

Robots Exclusion Standard data for owlnest.rice.edu

Resource Scan

Scan Details

Site Domain owlnest.rice.edu
Base Domain rice.edu
Scan Status Ok
Last Scan2025-07-27T12:59:45+00:00
Next Scan 2025-08-10T12:59:45+00:00

Last Scan

Scanned2025-07-27T12:59:45+00:00
URL https://owlnest.rice.edu/robots.txt
Domain IPs 13.68.101.62
Response IP 13.68.101.62
Found Yes
Hash 35b1a153eb1be7755a03e431186b2732e72635b89365bd099385373affe38261
SimHash ed14dc35a95b

Groups

*

Rule Path
Disallow /notfound
Disallow /forbidden
Disallow /error
Disallow /api/
Disallow /engage/

Other Records

Field Value
sitemap https://owlnest.rice.edu/sitemap.xml