cscanvas.rice.edu
robots.txt

Robots Exclusion Standard data for cscanvas.rice.edu

Resource Scan

Scan Details

Site Domain cscanvas.rice.edu
Base Domain rice.edu
Scan Status Ok
Last Scan2025-09-23T10:13:45+00:00
Next Scan 2025-10-07T10:13:45+00:00

Last Scan

Scanned2025-09-23T10:13:45+00:00
URL https://cscanvas.rice.edu/robots.txt
Domain IPs 107.22.247.25, 44.218.116.23, 52.45.98.229
Response IP 107.22.247.25
Found Yes
Hash d3356aecd32de23a8490ad22b9438c442bb445458c8b1f7304f1c8108ec8d422
SimHash b28d298d6470

Groups

*

Rule Path
Disallow /page_views/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: