canvas.upenn.edu
robots.txt

Robots Exclusion Standard data for canvas.upenn.edu

Resource Scan

Scan Details

Site Domain canvas.upenn.edu
Base Domain upenn.edu
Scan Status Ok
Last Scan2025-02-28T01:02:59+00:00
Next Scan 2025-03-14T01:02:59+00:00

Last Scan

Scanned2025-02-28T01:02:59+00:00
URL https://canvas.upenn.edu/robots.txt
Domain IPs 3.215.14.220, 44.216.224.159, 44.223.185.240
Response IP 44.223.185.240
Found Yes
Hash d3356aecd32de23a8490ad22b9438c442bb445458c8b1f7304f1c8108ec8d422
SimHash b28d298d6470

Groups

*

Rule Path
Disallow /page_views/

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines: