middlesex.edu
robots.txt

Robots Exclusion Standard data for middlesex.edu

Resource Scan

Scan Details

Site Domain middlesex.edu
Base Domain middlesex.edu
Scan Status Ok
Last Scan2025-09-13T04:01:06+00:00
Next Scan 2025-10-13T04:01:06+00:00

Last Scan

Scanned2025-09-13T04:01:06+00:00
URL https://middlesex.edu/robots.txt
Domain IPs 3.209.26.193, 34.236.193.193, 52.73.2.219
Response IP 3.209.26.193
Found Yes
Hash 0b94863f262d9db4c1871c2267281bacdac53d994422472c53d4f1e0c2126786
SimHash 2810d87261f1

Groups

blackboardally

Rule Path
Allow /
Allow /*.pdf$

*

Rule Path
Disallow /_qa/
Disallow /_resources/
Disallow /_showcase/
Disallow /_training/
Disallow /cms_testing/
Disallow /directory/
Disallow /omni_cms/
Disallow /*.pdf$