canvas.wheatoncollege.edu
robots.txt

Robots Exclusion Standard data for canvas.wheatoncollege.edu

Resource Scan

Scan Details

Site Domain canvas.wheatoncollege.edu
Base Domain wheatoncollege.edu
Scan Status Ok
Last Scan2025-11-04T08:42:31+00:00
Next Scan 2025-11-18T08:42:31+00:00

Last Scan

Scanned2025-11-04T08:42:31+00:00
URL https://canvas.wheatoncollege.edu/robots.txt
Domain IPs 44.237.121.94, 52.11.0.235, 52.40.42.98
Response IP 52.40.42.98
Found Yes
Hash d3356aecd32de23a8490ad22b9438c442bb445458c8b1f7304f1c8108ec8d422
SimHash b28d298d6470

Groups

*

Rule Path
Disallow /page_views/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: