amherst.edu
robots.txt

Robots Exclusion Standard data for amherst.edu

Resource Scan

Scan Details

Site Domain amherst.edu
Base Domain amherst.edu
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-10T19:22:24+00:00
Next Scan 2024-12-09T19:22:24+00:00

Last Successful Scan

Scanned2024-07-20T07:34:42+00:00
URL https://amherst.edu/robots.txt
Redirect https://www.amherst.edu/robots.txt
Redirect Domain www.amherst.edu
Redirect Base amherst.edu
Domain IPs 2600:9000:229f:2c00:11:97cf:6640:93a1, 2600:9000:229f:4400:11:97cf:6640:93a1, 2600:9000:229f:5400:11:97cf:6640:93a1, 2600:9000:229f:8200:11:97cf:6640:93a1, 2600:9000:229f:aa00:11:97cf:6640:93a1, 2600:9000:229f:c000:11:97cf:6640:93a1, 2600:9000:229f:d000:11:97cf:6640:93a1, 2600:9000:229f:d400:11:97cf:6640:93a1, 54.230.71.111, 54.230.71.17, 54.230.71.32, 54.230.71.92
Redirect IPs 13.224.163.102, 13.224.163.112, 13.224.163.2, 13.224.163.88
Response IP 3.165.102.126
Found Yes
Hash 4c695e720c401a0a6a9180dd7eff04a7283f0db3361a60f865dbd33bb907be60
SimHash a4908d1ba760

Groups

*

Rule Path
Allow /core/*.css$
Allow /core/*.css?
Allow /core/*.js$
Allow /core/*.js?
Allow /core/*.gif
Allow /core/*.jpg
Allow /core/*.jpeg
Allow /core/*.png
Allow /core/*.svg
Allow /profiles/*.css$
Allow /profiles/*.css?
Allow /profiles/*.js$
Allow /profiles/*.js?
Allow /profiles/*.gif
Allow /profiles/*.jpg
Allow /profiles/*.jpeg
Allow /profiles/*.png
Allow /profiles/*.svg
Disallow /core/
Disallow /profiles/
Disallow /README.txt
Disallow /web.config
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips
Disallow /node/add/
Disallow /search/
Disallow /user/register
Disallow /user/password
Disallow /user/login
Disallow /user/logout
Disallow /media/oembed
Disallow /*/media/oembed
Disallow /index.php/admin/
Disallow /index.php/comment/reply/
Disallow /index.php/filter/tips
Disallow /index.php/node/add/
Disallow /index.php/search/
Disallow /index.php/user/password
Disallow /index.php/user/register
Disallow /index.php/user/login
Disallow /index.php/user/logout
Disallow /index.php/media/oembed
Disallow /index.php/*/media/oembed
Disallow /login/
Disallow /gsearch/
Disallow /loginpc/
Disallow /search/
Disallow /mm-browser/
Disallow /mm-auto/
Disallow /auto-facstaff/
Disallow /txtimg/
Disallow /amherstprofile/
Disallow /myamherst/
Disallow /myportal/
Disallow /A/
Disallow /B/
Disallow /C/
Disallow /D/
Disallow /E/
Disallow /F/
Disallow /G/
Disallow /H/
Disallow /I/
Disallow /J/
Disallow /K/
Disallow /L/
Disallow /M/
Disallow /N/
Disallow /O/
Disallow /P/
Disallow /Q/
Disallow /R/
Disallow /S/
Disallow /T/
Disallow /U/
Disallow /V/
Disallow /W/
Disallow /X/
Disallow /Y/
Disallow /Z/
Disallow /alumni/classpages/1990/notes/
Disallow /alumni/classpages/2008/classnotes/
Disallow /alumni/classpages/2004/classnotes/
Disallow /alumni/classpages/1976/notes/
Disallow /alumni/classpages/1968/1968notes/
Disallow /alumni/volunteers/classofficers/reunionhandbook/chairsandpresidents/
Disallow /offices/it/help/seniors/alumni/node/45938/webform-results/
Disallow /alumni/etools/forwarding/node/45938/webform-results/
Disallow /alumni/classpages/1992/classnotes1992/Spring200792/
Disallow /alumni/classpages/1992/classnotes1992/summer201092/
Disallow /media/view/120714/
Disallow /academiclife/departments/courses/0910S/ENGL/ENGL-05-0910S/Assignments/
Disallow /academiclife/departments/courses/0809S/AMST/AMST-68-0809S/Assignment3/
Disallow /academiclife/departments/courses/0809S/AMST/AMST-68-0809S/Assignment4/
Disallow /academiclife/departments/courses/0809F/ENGL/ENGL-05-0809F/assignment/
Disallow /academiclife/departments/courses/0708S/AMST/AMST-68-0708S/Assignment3/
Disallow /academiclife/departments/courses/0708S/AMST/AMST-68-0708S/Assignment4/
Disallow /academiclife/departments/courses/1011S/SOCI/SOCI-16-1011S/acstudent/
Disallow /users/T/ctarantino/uploaded-files/
Disallow /users/H/jlhostage/uploaded-files
Disallow /academiclife/departments/political_science/poliscialumni/alumniroster
Disallow /academiclife/funding/students/howard_hughes_fellowship/pdf
Disallow /alumni/classpages/1983/bookpdfs
Disallow /media/view/397252/
Disallow /users/F/kafairclough90/
Disallow /system/files/media/0687/Background%252520information%252520and%252520talking%252520points%252520for%252520volunteers.pdf
Disallow /give/annual_fund/volunteer/volunteer_alumni/tools/talkingpoints
Disallow /offices/it-old
Disallow /users/H/dbhutton/uploaded-files
Disallow /campuslife/reslife/housing/roomdraw/room_assignments
Disallow /people/facstaff/cgrobe/uploadedfiles
Disallow /media/view/385806/original/FY11VolunteerCommunictations.pdf
Disallow /media/view/385768
Disallow /media/view/385806
Disallow /users/F/pferrermedina
Disallow /system/files/styles/large/adaptive-image/private/media/0915/Copy%2520of%2520DSCF2969_edited.JPG?itok=-Myyr1Fg
Disallow /system/files/media/0915/Copy%2520of%2520DSCF2969_edited.JPG?itok=-Myyr1Fg
Disallow /users/M/nmoore18
Disallow /system/files/knoxparkapp.docx
Disallow /users/C/u5dcanagasabey
Disallow /users/K/akohbara14
Disallow /users/G/u5lgerberg
Disallow /users/A/maabiodun94
Disallow /academiclife/departments/courses/1112S/MUSI/MUSI-242-1112S/classblog
Disallow /securimage
Disallow /shibboleth.sso
Disallow /shibboleth.sso/login
Disallow /Shibboleth.sso
Disallow /Shibboleth.sso/Login
Disallow /links/
Disallow /media/view/85722/original/nelson_cv.pdf
Disallow /users/M/chmorgan62/testsubpage
Disallow /~wamh
Disallow /academiclife/global-learning/study_abroad/files
Disallow /taxonomy

Other Records

Field Value
crawl-delay 10

Comments

  • robots.txt
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • This file will be ignored unless it is at the root of your host:
  • Used: http://example.com/robots.txt
  • Ignored: http://example.com/site/robots.txt
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • CSS, JS, Images
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)

Warnings

  • 2 invalid lines.