belhaven.edu
robots.txt

Robots Exclusion Standard data for belhaven.edu

Resource Scan

Scan Details

Site Domain belhaven.edu
Base Domain belhaven.edu
Scan Status Ok
Last Scan2024-11-15T17:44:18+00:00
Next Scan 2024-12-15T17:44:18+00:00

Last Scan

Scanned2024-11-15T17:44:18+00:00
URL https://belhaven.edu/robots.txt
Redirect https://www.belhaven.edu/robots.txt
Redirect Domain www.belhaven.edu
Redirect Base belhaven.edu
Domain IPs 64.77.124.16
Redirect IPs 64.77.124.16
Response IP 64.77.124.16
Found Yes
Hash 48716ca51a496273bca90abb6e45d8a4f60bcd42a2732eda28c88c9c86428a87
SimHash 2aad1a1c9031

Groups

*

Rule Path
Disallow /__belhaven.edu
Disallow /1
Disallow /templates
Disallow /libraries
Disallow 404.html
Disallow e-newsletter_error.htm
Disallow e-newsletter_thank_you.htm
Disallow cancel_other.htm
Disallow cancel_studentaccount.htm
Disallow test.htm
Disallow index1.htm
Disallow index2.htm
Disallow index3.htm
Disallow index-test.htm
Disallow index-give.htm
Disallow /homepage
Disallow /ASX
Disallow /generator
Disallow /search
Disallow /vstudent
Disallow /pdfs/employment/letter-of-employment-commitment.pdf
Disallow /teenpact
Disallow /e-newsletter_thank_you.htm
Disallow /e-newsletter_error.htm
Disallow /cancel_studentaccount.htm
Disallow /cancel_other.htm
Disallow /drparrottvideo
Disallow /bookstore
Disallow /infotest
Disallow /news/articles/Duplicates
Disallow /news/ebooks
Disallow /pdfs/catalogue
Disallow /purl
Disallow /ads
Disallow /ahs
Disallow /ibc
Disallow /forms1
Disallow /_cms
Disallow /_demo
Disallow /admin
Disallow /university-life/calendar/calendar-television.html
Disallow /university-life/calendar/calendar-blazenet.html
Disallow /about/consumer-info/cip-codes.html
Disallow /about/consumer-info/data/
Disallow /about/consumer-info/professional-licensure.html

Other Records

Field Value
sitemap https://www.belhaven.edu/sitemap.xml
sitemap https://www.belhaven.edu/news/sitemap.xml
sitemap https://www.belhaven.edu/about/contact/faculty/sitemap.xml
sitemap https://www.belhaven.edu/about/contact/staff/sitemap.xml
sitemap https://www.belhaven.edu/university-life/student-profiles/sitemap.xml

Comments

  • /robots.txt as defined in
  • <http://info.webcrawler.com/mak/projects/robots/exclusion.html>