Difference between revisions of "SUNScholar/Digitisation"
| Line 93: | Line 93: | ||
=Documents= | =Documents= | ||
| + | * http://en.wikipedia.org/wiki/The_Document_Foundation | ||
* http://documentfreedom.org | * http://documentfreedom.org | ||
* http://www.pdfa.org | * http://www.pdfa.org | ||
* http://en.wikipedia.org/wiki/PDF/A | * http://en.wikipedia.org/wiki/PDF/A | ||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
| − | |||
;Comments | ;Comments | ||
<pre> | <pre> | ||
| Line 131: | Line 126: | ||
Email: trn@dsp.sun.ac.za | Email: trn@dsp.sun.ac.za | ||
</pre> | </pre> | ||
| + | |||
=Microfiche= | =Microfiche= | ||
* http://www.fctec.co.za | * http://www.fctec.co.za | ||
Revision as of 09:40, 22 January 2011
Contents
Library Policy re Digitization
Unfortunately the library does not have sufficient capacity at this stage to provide digitization services. Digitization conducted by the library is mainly to address internal needs. We can - however - provide advise, some basic training or share best practices when required.
Please go ahead as follows:
- Dept. purchases its own scanner equipment and the necessary software, and appoint someone to do the scanning if volume requires such a person.
- The dept. approach an external institution to assist. Please see below.
Service Providers re Digitization
Centre for Business and Language Services
Helena Liebenberg
Tel.: +27 (21) 949 2736
E-mail: helena@sentrum.co.za
FirstCoast Technologies
Herman Crowther
Managing Director
Tel.: +27 (21) 425 1833/ 082 801 6209
E-mail: herman@fctec.co.za
Treventus
Objectives
The objective is to convert to digital format any material using digital equipment.
The resultant digital object must adhere to the following:
- Use an uncompressed bitstream for storage.
- Use open digital formats with no patent liability and which have open published standards.
For more information, see: http://en.wikipedia.org/wiki/Digitizing
Digital Format Registry
Common Closed Digital Formats
See: http://patentabsurdity.com and http://en.swpat.org
Documents
All the Microsoft document formats are closed.
This is a huge problem for digital preservation.
- http://www.digitalpreservation.gov/formats/intro/intro.shtml
- http://en.wikipedia.org/wiki/Comparison_of_Office_Open_XML_and_OpenDocument
Multimedia
All the Microsoft media formats are closed.
This is a huge problem for digital preservation.
- Other closed media formats
- http://en.wikipedia.org/wiki/Category:Open_formats_closed_by_software_patents
- http://en.wikipedia.org/wiki/Mp3 (Lossy audio codec, many patent trolls)
- http://en.wikipedia.org/wiki/Advanced_Audio_Coding (Lossy audio codec, many patent trolls)
- http://en.wikipedia.org/wiki/Mpeg4 (Lossy video codec, many patent trolls)
- http://en.wikipedia.org/wiki/Jpeg (Lossy image codec, many patent trolls)
- http://en.wikipedia.org/wiki/Tagged_Image_File_Format (Lossless image codec, many patent trolls)
- http://en.wikipedia.org/wiki/Flash_Video (Closed format multimedia container, many patent trolls)
Multimedia
Converter software
- http://www.longtailvideo.com/support/blog/12633/an-overview-of-audio-and-video-transcoding
- http://www.nchsoftware.com/index.html
- http://www.gnomefiles.org/app.php/OggConvert
- http://www.linuxrising.org/transmageddon
Open Codecs
Audio
Video
Images
Container Formats
Documents
- http://en.wikipedia.org/wiki/The_Document_Foundation
- http://documentfreedom.org
- http://www.pdfa.org
- http://en.wikipedia.org/wiki/PDF/A
- Comments
Dear Hilton, I would advise that you adopt open (i.e. non-propriety) standards, as these have the best chance of remaining readable in the long-term future. Propriety formats are dependent on the continuing existence of the firm who markets them, as well as the continued support by this firm, even if they continue to exist. This is in my opinion very risky. For documents I am aware of an ISO standard that is targeted at archival, known as PDF/A (see www.pdfa.org). For audio and video the situation is less developed, and there are as far as I know no standards specifically for archival. In both cases I would recommend that data be saved without lossy compression, and again that open standards be sought. Hence mp3 and WMV should be avoided, both because they are based on lossy compression and are are propriety. The audio format FLAC on the other hand is open and does not employ lossy compression. I hope this is of help, Best regards, Thomas Niesler. ------------------------------------------------ Prof. Thomas Niesler Digital Signal Processing Group Department of Electronic Engineering University of Stellenbosch Private Bag X1, Stellenbosch 7602, South Africa Phone: +27 21 8084118 Fax: +27 21 8084981 Email: trn@dsp.sun.ac.za
Microfiche
Software
Data Sets
- http://en.wikipedia.org/wiki/Sql
- http://en.wikipedia.org/wiki/Comparison_of_relational_database_management_systems
- http://en.wikipedia.org/wiki/Comparison_of_database_tools
Engineering drawings
See: http://www.opendesign.com
Metadata
Click on the heading above.
Language
- http://en.wikipedia.org/wiki/UTF-8
- http://en.wikipedia.org/wiki/Langauge_codes
- http://www.loc.gov/standards/iso639-2/php/code_list.php
Digitisation Guidelines
| Media type | Resolution | Bit depth | Enhancements Allowed |
|---|---|---|---|
| Printed text | 300 dpi | Bitonal | Sharpening, descreening, cropping, deskewing, despeckling |
| Rare/ damaged printed text | 400 dpi | 8-gray or 24 colour | Contrast stretching; Minimal adjustments for tone and colour |
| Book illustrations | 400 - 600 dpi with enhancement | 8-gray or 24 colour; Bitonal | Contrast stretching; Minimal adjustments for tone and colour; Descreen/ rescreen, sharpen |
| Manuscripts | 300 - 500 dpi with enhancement | 8-gray or 24 colour | Contrast stretching; Minimal adjustments for tone and colour |
| Maps and other oversized items | 300 - 400 dpi | 8-gray or 24 colour | Contrast stretching; Minimal adjustments for tone and colour |
| Graphic Art | 400 - 600 dpi | 8-bit/ channel internal reduction | Contrast stretching; Minimal adjustments for tone and colour |
- Please note
- All archival material to be digitised in tiff format
- Tiff copy together with derivated png or any additional copies to be submitted to SUNScholar
- Document provenance metadata:
- dc.description.provenance e.g. Original scanned in at 600 dpi, 100% DigiBook 10000 RGB colour, downsized to 840 pixels in width, resolution 250. Web version done automatically by PhotoShop 7 software. Downloading time approx. 26 seconds. Date done March - April 2007.
Back to IR Help