keene.edu
robots.txt

Robots Exclusion Standard data for keene.edu

Resource Scan

Scan Details

Site Domain keene.edu
Base Domain keene.edu
Scan Status Ok
Last Scan2024-09-26T07:52:37+00:00
Next Scan 2024-10-03T07:52:37+00:00

Last Scan

Scanned2024-09-26T07:52:37+00:00
URL https://www.keene.edu/robots.txt
Domain IPs 50.19.103.154
Response IP 50.19.103.154
Found Yes
Hash 461ce85768f9fe904078c35337d4c2f1bc96683c3f3e58b9d17f5d239e9891f9
SimHash ed545b621e75

Groups

*

Rule Path
Allow /
Disallow /sitemedia/
Disallow /site/directories/facstaff/
Disallow /site/directories/departments/
Disallow /site/directories/students/
Disallow /site/directories/profile/
Disallow /site/directories/alumni/
Disallow /catalog/resources/people/
Disallow /documentation/
Disallow /notification/
Disallow /assets/
Disallow /hold/
Disallow /lp/
Disallow /kst/
Disallow /directories/services/
Disallow /admissions/services/
Disallow /catalog/services/
Disallow /events/services/
Disallow /administration/*/assets/
Disallow /development/*/assets/
Disallow /admissions/*/assets/
Disallow /academics/*/assets/
Disallow /parents/*/assets/
Disallow /alumni/*/assets/
Disallow /campus/*/assets/
Disallow /office/*/assets/
Disallow /arts/*/assets/
Disallow /life/*/assets/
Disallow /ksc/assets/files/*.pdf
Disallow /ksc/assets/files/*.doc
Disallow /ksc/assets/files/*.docx
Disallow /ksc/assets/files/*.xls
Disallow /ksc/assets/files/*.xlsx
Disallow /ksc/assets/files/*.ppt
Disallow /ksc/assets/files/*.pptx
Allow /assets/tools/sitemap/*
Allow /sitemedia/static/css/*
Allow /sitemedia/static/images/*
Allow /sitemedia/static/scripts/*

panscient.com

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

academicbotrtu

Rule Path
Disallow /

Comments

  • DISALLOWED DIRECTORIES
  • FILES SHOULD ONLY INDEX VIA /download/ LINKS
  • DISALLOWED EXEMPTIONS
  • BAD ROBOTS