SUNScholar/Repository Website Metrics
Contents
Webometrics
Harvesters
Robots
See below for an example robots.txt file.
User-agent: * # Disable access to Discovery search and filters Disallow: /discover Disallow: /search-filter # This should be the FULL URL to your HTML Sitemap. # Make sure to replace "[dspace.url]" with the value of your 'dspace.url' setting in your dspace.cfg file. Sitemap: http://scholar.sun.ac.za/htmlmap # If you have configured DSpace (Solr-based) Statistics to be publicly accessible, # then you likely do not want this content to be indexed # Disallow: /displaystats # Uncomment the following line ONLY if sitemaps.org or HTML sitemaps are used # and you have verified that your site is being indexed correctly. # Disallow: /browse # You also may wish to disallow access to the following paths, in order # to stop web spiders from accessing user-based content: # Disallow: /advanced-search # Disallow: /contact # Disallow: /feedback # Disallow: /forgot # Disallow: /login # Disallow: /register # Disallow: /search
HTML Head Tag Metadata
If you have heavily customized your metadata fields away from Dublin Core, you can modify the crosswalk that generates these elements by modifying:
[dspace]/config/crosswalks/xhtml-head-item.properties
Directories
- http://repositories.webometrics.info
- http://roar.eprints.org
- http://www.opendoar.org
- http://www.arwu.org
- http://www.webometrics.info
- http://en.wikipedia.org/wiki/College_and_university_rankings
- http://www.topuniversities.com/world-university-rankings
References
- https://wiki.duraspace.org/pages/viewpage.action?pageId=34642415
- http://www.dspace.org/1_6_2Documentation/ch05.html#N142ED
- http://www.dspace.org/1_5_2Documentation/ch03.html#N10B41
Back to Web Analytics