dspace.unive.it
robots.txt

Robots Exclusion Standard data for dspace.unive.it

Resource Scan

Scan Details

Site Domain dspace.unive.it
Base Domain unive.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-06-01T13:18:21+00:00
Next Scan 2025-07-31T13:18:21+00:00

Last Successful Scan

Scanned2025-03-11T13:16:38+00:00
URL http://dspace.unive.it/robots.txt
Domain IPs 157.138.7.91
Response IP 157.138.7.91
Found Yes
Hash d64ef5155ff990167b82280716e8387cbf529024f3b3bd5d7686d93e77fe0f83
SimHash f87d1595254c

Groups

*

Rule Path
Disallow /advanced-search
Disallow /contact
Disallow /feedback
Disallow /forgot
Disallow /login
Disallow /register
Disallow /search

Comments

  • ====
  • The contents of this file are subject to the license and copyright
  • detailed in the LICENSE and NOTICE files at the root of the source
  • tree and available online at
  • http://www.dspace.org/license/
  • ====
  • Uncomment the following line ONLY if sitemaps.org or HTML sitemaps are used
  • and you have verified that your site is being indexed correctly.
  • Disallow: /browse
  • You also may wish to disallow access to the following paths, in order
  • to stop web spiders from accessing user-based content: