SUNScholar/Repository Website Metrics
Jump to navigation
Jump to search
Contents
Google Settings
Register with the major harvesters
Suggestions from the webometrics ranking editors
See: http://repositories.webometrics.info/en/Best_Practices
All the scientific production, formal and informal, draft or definitive, published or unpublished, should be available from a unique web site. The institutional repository is a very important asset of the institution as a whole, not only of the library. We recommend the following syntax for the institutional repository web address:
http://repository.university.country
- It is very important to avoid changing the institutional domain as it can generate confusion and it has a devastating effect on the visibility values.
- Avoid cumbersome navigation menus based on Flash, Java or JavaScript that can block the robot access.
- For scientists it is important that the link to the full text would be easily citable.
- Therefore Very Long URLs should be avoided in all situations.
Good examples of repositories with friendly persistent URL's as per webometrics best practices
- http://scholar.sun.ac.za
- http://repository.up.ac.za
- http://repository.uwc.ac.za
- http://repository.unam.na
- http://uir.unisa.ac.za
- http://ir.dut.ac.za
- http://ir.polytechnic.edu.na
- http://dar.aucegypt.edu
- http://www.ubrisa.ub.bw
Robots
See below for an example robots.txt file.
User-agent: * # Disable access to Discovery search and filters Disallow: /discover Disallow: /search-filter # This should be the FULL URL to your HTML Sitemap. # Make sure to replace "[dspace.url]" with the value of your 'dspace.url' setting in your dspace.cfg file. Sitemap: http://[dspace.url]/htmlmap # If you have configured DSpace (Solr-based) Statistics to be publicly accessible, # then you likely do not want this content to be indexed # Disallow: /displaystats # Uncomment the following line ONLY if sitemaps.org or HTML sitemaps are used # and you have verified that your site is being indexed correctly. # Disallow: /browse # You also may wish to disallow access to the following paths, in order # to stop web spiders from accessing user-based content: # Disallow: /advanced-search # Disallow: /contact # Disallow: /feedback # Disallow: /forgot # Disallow: /login # Disallow: /register # Disallow: /search
HTML Metadata
If you have heavily customized your metadata fields away from Dublin Core, you can modify the crosswalk that generates these elements by modifying:
[dspace]/config/crosswalks/xhtml-head-item.properties.
If you have heavily customized your metadata fields, or wish to change the default "mappings" to these Highwire Press tags, they are configurable in:
[dspace]/config/crosswalks/google-metadata.properties
Directories
- http://repositories.webometrics.info
- http://roar.eprints.org
- http://www.opendoar.org
- http://www.arwu.org
- http://www.webometrics.info
- http://en.wikipedia.org/wiki/College_and_university_rankings
- http://www.topuniversities.com/world-university-rankings
References
- https://wiki.duraspace.org/pages/viewpage.action?pageId=34642415
- https://wiki.duraspace.org/display/DSDOC18/Configuration#Configuration-SitemapSettings
- https://wiki.duraspace.org/display/DSDOC17/Configuration#Configuration-SitemapSettings
- http://www.dspace.org/1_6_2Documentation/ch05.html#N142ED
- http://www.dspace.org/1_5_2Documentation/ch03.html#N10B41
Back to Web Analytics