www.library.auckland.ac.nz
robots.txt

Robots Exclusion Standard data for www.library.auckland.ac.nz

Resource Scan

Scan Details

Site Domain www.library.auckland.ac.nz
Base Domain auckland.ac.nz
Scan Status Ok
Last Scan2024-09-19T07:17:24+00:00
Next Scan 2024-10-19T07:17:24+00:00

Last Scan

Scanned2024-09-19T07:17:24+00:00
URL https://www.library.auckland.ac.nz/robots.txt
Domain IPs 130.216.156.115
Response IP 130.216.156.115
Found Yes
Hash ff6afb0ba63b8f2e7998ab366dd0e7802a60d2c19408973c60673f4333dfc0a9
SimHash 109c7da8c659

Groups

psbot

Rule Path
Disallow /

gsa-crawler+(enterprise;+m2-bna7wtakca2ja;+google_mini_admin@auckland.ac.nz)

Rule Path
Disallow /admin
Disallow /w31
Disallow /global
Disallow /tv-radio/search/
Disallow /tv-radio/title/
Disallow /tv-radio/ondemand
Disallow /tv-radio/request/media/
Disallow /tv-radio/programme/
Disallow /tv-radio/request/
Disallow /tv-radio/browse/
Disallow /exam-papers/course/
Disallow /exam-papers/subject/
Disallow /exam-papers/search/
Disallow /search/
Disallow /data/
Disallow /ereserves
Disallow /eproducts

swiftbot

Rule Path
Disallow /admin
Disallow /w31
Disallow /global
Disallow /databases
Disallow /tv-radio/search/
Disallow /tv-radio/title/
Disallow /tv-radio/ondemand
Disallow /tv-radio/request/media/
Disallow /tv-radio/programme/
Disallow /tv-radio/request/
Disallow /tv-radio/browse/
Disallow /exam-papers/course/
Disallow /exam-papers/subject/
Disallow /exam-papers/search/
Disallow /search/
Disallow /data/
Disallow /ereserves
Disallow /eproducts

*

Rule Path
Disallow /a_plus
Disallow /admin
Disallow /asklib
Disallow /booking
Disallow /docs
Disallow /data/
Disallow /edu
Disallow /faqs
Disallow /forms
Disallow /global
Disallow /w31
Disallow /slc
Disallow /ebooks
Disallow /collections
Disallow /thesis
Disallow /for
Disallow /maps
Disallow /ereserves
Disallow /eproducts
Disallow /includes/
Disallow /misc/
Disallow /modules/
Disallow /profiles/
Disallow /scripts/
Disallow /themes/
Disallow /sites/all/themes/library/js/
Disallow /CHANGELOG.txt
Disallow /cron.php
Disallow /INSTALL.mysql.txt
Disallow /INSTALL.pgsql.txt
Disallow /INSTALL.sqlite.txt
Disallow /install.php
Disallow /INSTALL.txt
Disallow /LICENSE.txt
Disallow /MAINTAINERS.txt
Disallow /update.php
Disallow /UPGRADE.txt
Disallow /xmlrpc.php
Disallow /drupal.js?lziq77
Disallow /jquery.js?v=1.4.4
Disallow /lightbox.js?1329577213
Disallow /oogleanalytics.js?lziq77
Disallow /lib.js?lziq77
Disallow /jquery.once.js?v=1.2
Disallow /jquery_pagination.js?lziq77
Disallow /admin/
Disallow /comment/reply/
Disallow /filter/tips/
Disallow /node/add/
Disallow /search/
Disallow /user/register/
Disallow /user/password/
Disallow /user/login/
Disallow /user/logout/
Disallow /?q=admin%2F
Disallow /?q=comment%2Freply%2F
Disallow /?q=filter%2Ftips%2F
Disallow /?q=node%2Fadd%2F
Disallow /?q=search%2F
Disallow /?q=user%2Fpassword%2F
Disallow /?q=user%2Fregister%2F
Disallow /?q=user%2Flogin%2F
Disallow /?q=user%2Flogout%2F
Disallow /tv-radio/search/
Disallow /tv-radio/title/
Disallow /tv-radio/ondemand
Disallow /tv-radio/programme/
Disallow /tv-radio/request/
Disallow /tv-radio/browse/
Disallow /exam-papers/course/
Disallow /exam-papers/subject/
Disallow /exam-papers/search/

Comments

  • Crawl-Delay: 5 /not recognised
  • TV & Radio
  • Exam Papers
  • TV & Radio
  • Exam Papers
  • cms excludes
  • Directories
  • Files
  • Paths (clean URLs)
  • Paths (no clean URLs)
  • TV & Radio
  • Exam Papers