Difference between revisions of "SUNScholar/Repository Website Metrics"
Jump to navigation
Jump to search
Google
(→Robots) |
m (→Robots) |
||
| Line 12: | Line 12: | ||
See: https://github.com/DSpace/DSpace/pull/498 | See: https://github.com/DSpace/DSpace/pull/498 | ||
| − | See below for an example '''robots.txt''' file. | + | See below for an example '''<tt>http://scholar.sun.ac.za/robots.txt</tt>''' file. |
<pre> | <pre> | ||
User-agent: * | User-agent: * | ||
Revision as of 05:44, 13 September 2014
Back to Web Analytics
Webometrics
Harvesters
Robots
See: https://github.com/DSpace/DSpace/pull/498
See below for an example http://scholar.sun.ac.za/robots.txt file.
User-agent: * # Disable access to Discovery search and filters Disallow: /discover Disallow: /search-filter # This should be the FULL URL to your HTML Sitemap. # Make sure to replace "[dspace.url]" with the value of your 'dspace.url' setting in your dspace.cfg file. Sitemap: http://scholar.sun.ac.za/htmlmap # If you have configured DSpace (Solr-based) Statistics to be publicly accessible, # then you likely do not want this content to be indexed # Disallow: /displaystats # Uncomment the following line ONLY if sitemaps.org or HTML sitemaps are used # and you have verified that your site is being indexed correctly. # Disallow: /browse # You also may wish to disallow access to the following paths, in order # to stop web spiders from accessing user-based content: # Disallow: /advanced-search # Disallow: /contact # Disallow: /feedback # Disallow: /forgot # Disallow: /login # Disallow: /register # Disallow: /search
HTML Head Tag Metadata
If you have heavily customized your metadata fields away from Dublin Core, you can modify the crosswalk that generates these elements by modifying:
[dspace]/config/crosswalks/xhtml-head-item.properties