SUNScholar/Repository Website Metrics

From Libopedia
Revision as of 05:44, 13 September 2014 by Hgibson (talk | contribs) (→‎Robots)
Jump to navigation Jump to search
Back to Web Analytics

Google

Webometrics

Harvesters

Robots

See: https://github.com/DSpace/DSpace/pull/498

See below for an example http://scholar.sun.ac.za/robots.txt file.

User-agent: *
# Disable access to Discovery search and filters
Disallow: /discover
Disallow: /search-filter
 
# This should be the FULL URL to your HTML Sitemap. 
# Make sure to replace "[dspace.url]" with the value of your 'dspace.url' setting in your dspace.cfg file.
Sitemap: http://scholar.sun.ac.za/htmlmap
 
# If you have configured DSpace (Solr-based) Statistics to be publicly accessible,
# then you likely do not want this content to be indexed
# Disallow: /displaystats
 
# Uncomment the following line ONLY if sitemaps.org or HTML sitemaps are used
# and you have verified that your site is being indexed correctly.
# Disallow: /browse
 
# You also may wish to disallow access to the following paths, in order
# to stop web spiders from accessing user-based content:
# Disallow: /advanced-search
# Disallow: /contact
# Disallow: /feedback
# Disallow: /forgot
# Disallow: /login
# Disallow: /register
# Disallow: /search

HTML Head Tag Metadata

If you have heavily customized your metadata fields away from Dublin Core, you can modify the crosswalk that generates these elements by modifying:

[dspace]/config/crosswalks/xhtml-head-item.properties