more.lib.wi.us
robots.txt

Robots Exclusion Standard data for more.lib.wi.us

Resource Scan

Scan Details

Site Domain more.lib.wi.us
Base Domain more.lib.wi.us
Scan Status Ok
Last Scan2024-10-25T19:50:55+00:00
Next Scan 2024-11-24T19:50:55+00:00

Last Scan

Scanned2024-10-25T19:50:55+00:00
URL https://more.lib.wi.us/robots.txt
Domain IPs 205.213.240.40
Response IP 205.213.240.40
Found Yes
Hash e065f16069dbc56389de20d6c9bce29df22280706f56e9ad1ad0cc9cb75c0808
SimHash 25f6fd667cba

Groups

*

Rule Path
Disallow /acquire
Disallow /airpac
Disallow /airwkst
Disallow /articles
Disallow /availlim
Disallow /bookill
Disallow /bookit
Disallow /circhistlim
Disallow /circpix
Disallow /cisti_order
Disallow /clearhist
Disallow /documents
Disallow /donate
Disallow /extlang
Disallow /feeds
Disallow /ftlist
Disallow /goto
Disallow /iii
Disallow /ill
Disallow /illframe
Disallow /indexsort
Disallow /journill
Disallow /kids
Disallow /launch
Disallow /logout
Disallow /manage
Disallow /manual
Disallow /metafind
Disallow /mfgo
Disallow /netli
Disallow /nonret
Disallow /patroninfo
Disallow /programs
Disallow /record%3D
Disallow /review
Disallow /screens
Disallow /search
Disallow /selfreg
Disallow /setlang
Disallow /setscope
Disallow /suggest
Disallow /tmp
Disallow /validate
Disallow /VERIFYPATRON
Disallow /VERSION
Disallow /weblang
Disallow /wm
Disallow /xrecord%3D
Disallow /z39
Disallow /z39m

googlebot-ia

Rule Path
Disallow /acquire
Disallow /airpac
Disallow /airwkst
Disallow /articles
Disallow /availlim
Disallow /bookill
Disallow /bookit
Disallow /circhistlim
Disallow /circpix
Disallow /cisti_order
Disallow /clearhist
Disallow /documents
Disallow /donate
Disallow /extlang
Disallow /feeds
Disallow /ftlist
Disallow /goto
Disallow /iii
Disallow /ill
Disallow /illframe
Disallow /indexsort
Disallow /journill
Disallow /kids
Disallow /launch
Disallow /logout
Disallow /manage
Disallow /manual
Disallow /metafind
Disallow /mfgo
Disallow /netli
Disallow /nonret
Disallow /patroninfo
Disallow /programs
Disallow /record%3D
Disallow /review
Disallow /search
Disallow /selfreg
Disallow /setlang
Disallow /setscope
Disallow /suggest
Disallow /tmp
Disallow /validate
Disallow /VERIFYPATRON
Disallow /VERSION
Disallow /weblang
Disallow /wm
Disallow /xrecord%3D
Disallow /z39
Disallow /z39m

yandex

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

alphaseobot

Rule Path
Disallow /

alphaseobot-sa

Rule Path
Disallow /

Comments

  • This file instructs all WWW robots NOT to index pages that begin
  • with the URLS listed.
  • For the WebBridge Google Scholar Extension. Allows googlebot_IA to crawl
  • /screens