Difference between revisions of "SUNScholar/Media Filters/4.X"

From Libopedia
Jump to navigation Jump to search
 
(46 intermediate revisions by the same user not shown)
Line 1: Line 1:
 
<center>
 
<center>
  '''[[SUNScholar/Media Filters|Back to Media Filters]]'''
+
  '''[[SUNScholar/Media_Filters/Thumbnails|Back to Thumbnails]]'''
 
</center>
 
</center>
==Requirements==
+
==<font color="red">'''PLEASE NOTE:'''</font>==
Check the following and then return.
+
After a while we noticed our server load increasing radically when doing the nightly media-filter jobs. We isolated the problem to the "Branded Preview JPEG" filter. This filter has been disabled as these branded previews are not important to us.
http://wiki.lib.sun.ac.za/index.php/SUNScholar/Install_DSpace/S03#Step_3.2
 
  
==Step 1 - Login to the server==
+
==Step 1 - Install the Ubuntu software packages==
http://wiki.lib.sun.ac.za/index.php/SUNScholar/Prepare_Ubuntu/S01
 
 
 
==Step 2 - Install the Ubuntu software packages==
 
 
Type the following:
 
Type the following:
 
  sudo apt-get install xpdf poppler-utils curl
 
  sudo apt-get install xpdf poppler-utils curl
  
==Step 3 - Install the java packages==
+
==Step 2 - Install the java packages==
===Step 3A - Install "jai_imageio.jar"===
+
 
 +
===Step 2A - Install "jai_imageio.jar"===
 
  mkdir $HOME/temp  
 
  mkdir $HOME/temp  
  
 
  cd $HOME/temp
 
  cd $HOME/temp
  
  curl -O http://download.java.net/media/jai-imageio/builds/release/1.1/jai_imageio-1_1-lib-linux-i586.tar.gz
+
  wget --no-check-certificate http://download.java.net/media/jai-imageio/builds/release/1.1/jai_imageio-1_1-lib-linux-i586.tar.gz
  
 
  tar -xzvf jai_imageio-1_1-lib-linux-i586.tar.gz
 
  tar -xzvf jai_imageio-1_1-lib-linux-i586.tar.gz
Line 33: Line 30:
 
</pre>
 
</pre>
  
===Step 3B - Install "jai_core.jar"===
+
===Step 2B - Install "jai_core.jar"===
 
  mkdir $HOME/temp  
 
  mkdir $HOME/temp  
  
Line 50: Line 47:
 
</pre>
 
</pre>
  
==Step 4 - Configuration==
+
==Step 3 - Configuration==
===Step 4A===
+
===Step 3A===
First, be sure there is a value for ''thumbnail.maxwidth'' and that it corresponds to the size you want for preview images for the UI.
+
Edit the ''"dspace.cfg"'' file.
 +
nano $HOME/{{Source}}/dspace/config/dspace.cfg
 +
 
 +
First enable thumbnails, search for following and change to true:
 +
webui.browse.thumbnail.show = true
 +
 
 +
webui.item.thumbnail.show = true
 +
 
 +
webui.preview.enabled = true
 +
 
 +
Then, search for the following and change as needed:
 +
webui.preview.brand = My Institution Name
 +
 
 +
webui.preview.brand.abbrev = MyOrg
 +
 
 +
Lastly, be sure there is a value for ''thumbnail.maxwidth'' and that it corresponds to the size you want for preview images for the UI.
  
Edit the ''"dspace.cfg"'' file.
 
nano /home/dspace/source/config/dspace.cfg
 
 
Search for the following and modify.
 
Search for the following and modify.
 
<pre>
 
<pre>
Line 63: Line 73:
 
</pre>
 
</pre>
  
===Step 4B===
+
===Step 3B===
 
Search for "filter.plugins" and replace with the following.
 
Search for "filter.plugins" and replace with the following.
 
<pre>
 
<pre>
Line 76: Line 86:
 
</pre>
 
</pre>
  
===Step 4C===
+
===Step 3C===
 
Change the MediaFilter plugin configuration to remove the old ''"org.dspace.app.mediafilter.PDFFilter"'' and add the new filters ''"org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor"'' and ''"org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail"''. Replace with the following.
 
Change the MediaFilter plugin configuration to remove the old ''"org.dspace.app.mediafilter.PDFFilter"'' and add the new filters ''"org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor"'' and ''"org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail"''. Replace with the following.
  
Line 82: Line 92:
 
plugin.named.org.dspace.app.mediafilter.FormatFilter = \
 
plugin.named.org.dspace.app.mediafilter.FormatFilter = \
 
   org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor, \
 
   org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor, \
  org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail, \
 
 
   org.dspace.app.mediafilter.HTMLFilter = HTML Text Extractor, \
 
   org.dspace.app.mediafilter.HTMLFilter = HTML Text Extractor, \
 
   org.dspace.app.mediafilter.WordFilter = Word Text Extractor, \
 
   org.dspace.app.mediafilter.WordFilter = Word Text Extractor, \
 
   org.dspace.app.mediafilter.PowerPointFilter = PowerPoint Text Extractor, \
 
   org.dspace.app.mediafilter.PowerPointFilter = PowerPoint Text Extractor, \
 +
  org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail, \
 
   org.dspace.app.mediafilter.JPEGFilter = JPEG Thumbnail, \
 
   org.dspace.app.mediafilter.JPEGFilter = JPEG Thumbnail, \
 
   org.dspace.app.mediafilter.BrandedPreviewJPEGFilter = Branded Preview JPEG
 
   org.dspace.app.mediafilter.BrandedPreviewJPEGFilter = Branded Preview JPEG
 
</pre>
 
</pre>
  
===Step 4D===
+
===Step 3D===
 
Then replace ''"filter.org.dspace.app.mediafilter.PDFFilter.inputFormats = Adobe PDF"'' with the following:
 
Then replace ''"filter.org.dspace.app.mediafilter.PDFFilter.inputFormats = Adobe PDF"'' with the following:
 
<pre>
 
<pre>
Line 97: Line 107:
 
</pre>
 
</pre>
  
===Step 4E===
+
===Step 3E===
 
Above the comment, "#Custom settings for PDFFilter", add the following:
 
Above the comment, "#Custom settings for PDFFilter", add the following:
 
<pre>
 
<pre>
Line 107: Line 117:
  
 
==Step 4 - Build and Install==
 
==Step 4 - Build and Install==
Perform the following as the "dspace" user.
 
 
 
To build, type the following:
 
To build, type the following:
  cd /home/dspace/source
+
  cd $HOME/{{Source}}
  
 
  mvn -U clean package -Pxpdf-mediafilter-support
 
  mvn -U clean package -Pxpdf-mediafilter-support
 
To install, type the following: (Replace XXX with your DSpace version number)
 
To install, type the following: (Replace XXX with your DSpace version number)
  cd /home/dspace/source/dspace/target/dspace-XXX-build
+
  cd $HOME/{{Source}}/dspace/target/dspace-XXX-build
  
 
  ant update
 
  ant update
  
 
  ant clean_backups
 
  ant clean_backups
 +
 
==Step 5 - Update dspace rebuild script==
 
==Step 5 - Update dspace rebuild script==
 
If the test build works then add the switch"-Pxpdf-mediafilter-support" to the dspace rebuild script, so that:
 
If the test build works then add the switch"-Pxpdf-mediafilter-support" to the dspace rebuild script, so that:
Line 127: Line 136:
 
See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Rebuild_DSpace
 
See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Rebuild_DSpace
 
==Step 6 - Test the media filers==
 
==Step 6 - Test the media filers==
Type the following to test. Select an item that has pdf files attached and use it as replacement for "123456789/29097".
+
[[SUNScholar/Restart_DSpace|Restart DSpace]] and then type the following to test. Select an item that has pdf files attached and use it as replacement for "123456789/29097".
  /home/dspace/bin/dspace filter-media -n -v -i 123456789/29097
+
  $HOME/bin/dspace filter-media -n -v -i 123456789/29097
 +
 
 
==Step 7 - Create new thumbnails==
 
==Step 7 - Create new thumbnails==
Now that the tests have worked, type the following to force creation of new thumbnails for the entire system. The script is configured to do 1000 items at a time only. This saves on memory and CPU time. Therefore on a large system you may need to run the script several times. Also make sure that the dspace user has full read/write access to all items in the assetstore folders.
+
The scripts are configured to do 1000 items at a time only. This saves on memory and CPU time. Therefore on a large system you may need to run the script several times. Also make sure that the dspace user has full read/write access to all items in the assetstore folders (<tt>'''sudo chmod 0777 -R $HOME/assetstore/'''</tt>).
 +
 
 +
$HOME/bin/dspace filter-media -n -v -m 1000 -p "PDF Thumbnail"
 +
 
 +
$HOME/bin/dspace filter-media -n -v -m 1000 -p "JPEG Thumbnail"
  
/home/dspace/bin/dspace filter-media -n -v -f -m 1000 -p "PDF Thumbnail"
 
 
==Step 8 - Add a daily admin task==
 
==Step 8 - Add a daily admin task==
 
See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Daily_Admin. Check the '''"filter-media"''' options!
 
See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Daily_Admin. Check the '''"filter-media"''' options!
 +
==Step 9 - Item list preview settings==
 +
Edit the following file:
 +
nano $HOME/{{Source}}/dspace/config/dspace.cfg
 +
Search for <tt>'''xmlui.theme.mirage.item-list.emphasis'''</tt>. There are two options available namely "metadata" or "file", select "file" and save the "dspace.cfg" file, then rebuild DSpace. See example below.
 +
<pre>
 +
### Settings for Item lists in Mirage theme ###
 +
# What should the emphasis be in the display of item lists?
 +
# Possible values : 'file', 'metadata'. If your repository is
 +
# used mainly for scientific papers 'metadata' is probably the
 +
# best way. If you have a lot of images and other files 'file'
 +
# will be the best starting point
 +
# (metdata is the default value if this option is not specified)
 +
xmlui.theme.mirage.item-list.emphasis = file
 +
</pre>
  
 
==References==
 
==References==
Line 141: Line 168:
 
*http://packages.ubuntu.com/precise/xpdf
 
*http://packages.ubuntu.com/precise/xpdf
 
*https://gist.github.com/alanorth/b71a458e9b83c3a8015a
 
*https://gist.github.com/alanorth/b71a458e9b83c3a8015a
 +
[[Category:Customisation]]

Latest revision as of 15:51, 29 May 2016

Back to Thumbnails

PLEASE NOTE:

After a while we noticed our server load increasing radically when doing the nightly media-filter jobs. We isolated the problem to the "Branded Preview JPEG" filter. This filter has been disabled as these branded previews are not important to us.

Step 1 - Install the Ubuntu software packages

Type the following:

sudo apt-get install xpdf poppler-utils curl

Step 2 - Install the java packages

Step 2A - Install "jai_imageio.jar"

mkdir $HOME/temp 
cd $HOME/temp
wget --no-check-certificate http://download.java.net/media/jai-imageio/builds/release/1.1/jai_imageio-1_1-lib-linux-i586.tar.gz
tar -xzvf jai_imageio-1_1-lib-linux-i586.tar.gz
 mvn install:install-file \
                    -Dfile=jai_imageio-1_1/lib/jai_imageio.jar  \
                    -DgroupId=com.sun.media                     \
                    -DartifactId=jai_imageio                    \
                    -Dversion=1.0_01                            \
                    -Dpackaging=jar                             \
                    -DgeneratePom=true

Step 2B - Install "jai_core.jar"

mkdir $HOME/temp 
cd $HOME/temp 
wget --no-check-certificate https://m2.duraspace.org/content/repositories/thirdparty/org/fcrepo/jai_core/1.1.2_01/jai_core-1.1.2_01.jar
mvn install:install-file \
                    -Dfile=jai_core-1.1.2_01.jar  \
                    -DgroupId=javax.media                      \
                    -DartifactId=jai_core                      \
                    -Dversion=1.1.2_01                         \
                    -Dpackaging=jar                            \
                    -DgeneratePom=true

Step 3 - Configuration

Step 3A

Edit the "dspace.cfg" file.

nano $HOME/source/dspace/config/dspace.cfg

First enable thumbnails, search for following and change to true:

webui.browse.thumbnail.show = true
webui.item.thumbnail.show = true
webui.preview.enabled = true

Then, search for the following and change as needed:

webui.preview.brand = My Institution Name
webui.preview.brand.abbrev = MyOrg

Lastly, be sure there is a value for thumbnail.maxwidth and that it corresponds to the size you want for preview images for the UI.

Search for the following and modify.

# maximum width and height of generated thumbnails
thumbnail.maxwidth  = 160
thumbnail.maxheight = 160

Step 3B

Search for "filter.plugins" and replace with the following.

filter.plugins = \
        PDF Text Extractor, \
        PDF Thumbnail, \
        HTML Text Extractor, \
        Word Text Extractor, \
        PowerPoint Text Extractor, \
        JPEG Thumbnail, \
        Branded Preview JPEG

Step 3C

Change the MediaFilter plugin configuration to remove the old "org.dspace.app.mediafilter.PDFFilter" and add the new filters "org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor" and "org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail". Replace with the following.

plugin.named.org.dspace.app.mediafilter.FormatFilter = \
  org.dspace.app.mediafilter.XPDF2Text = PDF Text Extractor, \
  org.dspace.app.mediafilter.HTMLFilter = HTML Text Extractor, \
  org.dspace.app.mediafilter.WordFilter = Word Text Extractor, \
  org.dspace.app.mediafilter.PowerPointFilter = PowerPoint Text Extractor, \
  org.dspace.app.mediafilter.XPDF2Thumbnail = PDF Thumbnail, \
  org.dspace.app.mediafilter.JPEGFilter = JPEG Thumbnail, \
  org.dspace.app.mediafilter.BrandedPreviewJPEGFilter = Branded Preview JPEG

Step 3D

Then replace "filter.org.dspace.app.mediafilter.PDFFilter.inputFormats = Adobe PDF" with the following:

filter.org.dspace.app.mediafilter.XPDF2Thumbnail.inputFormats = Adobe PDF
filter.org.dspace.app.mediafilter.XPDF2Text.inputFormats = Adobe PDF

Step 3E

Above the comment, "#Custom settings for PDFFilter", add the following:

#The paths to the XPDF utilities
xpdf.path.pdftotext = /usr/bin/pdftotext
xpdf.path.pdftoppm  = /usr/bin/pdftoppm
xpdf.path.pdfinfo   = /usr/bin/pdfinfo

Step 4 - Build and Install

To build, type the following:

cd $HOME/source
mvn -U clean package -Pxpdf-mediafilter-support

To install, type the following: (Replace XXX with your DSpace version number)

cd $HOME/source/dspace/target/dspace-XXX-build
ant update
ant clean_backups

Step 5 - Update dspace rebuild script

If the test build works then add the switch"-Pxpdf-mediafilter-support" to the dspace rebuild script, so that:

mvn -U clean package

becomes

mvn -U clean package -Pxpdf-mediafilter-support

See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Rebuild_DSpace

Step 6 - Test the media filers

Restart DSpace and then type the following to test. Select an item that has pdf files attached and use it as replacement for "123456789/29097".

$HOME/bin/dspace filter-media -n -v -i 123456789/29097

Step 7 - Create new thumbnails

The scripts are configured to do 1000 items at a time only. This saves on memory and CPU time. Therefore on a large system you may need to run the script several times. Also make sure that the dspace user has full read/write access to all items in the assetstore folders (sudo chmod 0777 -R $HOME/assetstore/).

$HOME/bin/dspace filter-media -n -v -m 1000 -p "PDF Thumbnail"
$HOME/bin/dspace filter-media -n -v -m 1000 -p "JPEG Thumbnail"

Step 8 - Add a daily admin task

See: http://wiki.lib.sun.ac.za/index.php/SUNScholar/Daily_Admin. Check the "filter-media" options!

Step 9 - Item list preview settings

Edit the following file:

nano $HOME/source/dspace/config/dspace.cfg

Search for xmlui.theme.mirage.item-list.emphasis. There are two options available namely "metadata" or "file", select "file" and save the "dspace.cfg" file, then rebuild DSpace. See example below.

### Settings for Item lists in Mirage theme ###
# What should the emphasis be in the display of item lists?
# Possible values : 'file', 'metadata'. If your repository is
# used mainly for scientific papers 'metadata' is probably the
# best way. If you have a lot of images and other files 'file'
# will be the best starting point
# (metdata is the default value if this option is not specified)
xmlui.theme.mirage.item-list.emphasis = file

References