towson.edu
robots.txt

Robots Exclusion Standard data for towson.edu

Resource Scan

Scan Details

Site Domain towson.edu
Base Domain towson.edu
Scan Status Ok
Last Scan2025-12-24T11:31:39+00:00
Next Scan 2026-01-23T11:31:39+00:00

Last Scan

Scanned2025-12-24T11:31:39+00:00
URL https://towson.edu/robots.txt
Redirect https://www.towson.edu/robots.txt
Redirect Domain www.towson.edu
Redirect Base towson.edu
Domain IPs 52.224.91.77
Redirect IPs 52.224.91.77
Response IP 52.224.91.77
Found Yes
Hash 078bd2301ac3798e49240f6b4949af8e26414a09bdc7f904d60662e65e1ac1bd
SimHash 1c51b30b4bdc

Groups

*

Rule Path
Disallow /_dev/
Disallow /_training/
Disallow /_testingtemplates/
Disallow /_oneofftesting/
Disallow /_outesting/
Disallow /_resources/_htmls
Disallow /_resources/backups
Disallow /_resources/gadgets
Disallow /_resources/ou
Disallow /_resources/scripts
Disallow /_resources/snippets
Disallow /_resources/tuenduserguide.pdf
Disallow /_resources/xmldata
Disallow /_resources/xsl
Disallow /_resources/xsl-1
Disallow /_resources/dmc
Disallow /campaign/
Disallow /news/archive.html?tag*
Disallow /news/archive.html?type*

adsbot-google

Rule Path
Allow /campaign/

archive.org_bot

Rule Path
Allow /_resources/css/
Allow /_resources/includes/
Allow /_resources/images/
Allow /_resources/js/
Allow /_resources/scripts/
Allow /_resources/dmc/