OpenData

From Libopedia
Jump to: navigation, search
Back to Open Campus

Introduction

Science must be reproducible, therefore archiving of digital experimental data and methods is critically important.

Lately big data has become a "hot-topic". Below is information that explains what big data is and how it differs from research data.

What is research data?

Watch the following animated fictional videos for an introduction:

https://youtu.be/N2zK3sAtr-4
https://youtu.be/nNBiCcBlwRA

Rdm.png

What is big data?

Img bigdata.png

What are the major differences between research data (RD) and big data (BD)?

  1. BD datasets are huge => 1 Terabyte (TB), RD datasets are much smaller <= 1 Gigabyte (GB).
  2. BD is unstructured and uses a NOSQL database such as MongoDB or OrientDB. RD is very structured and uses a SQL database such as PostgreSQL.
  3. BD has a many sources, RD has usually one source.
  4. BD is collected in real time, RD is collected after analysis.

Basically BD tools are used to "surface" patterns from huge datasets, usually in real time, and make predictions whereas RD is used to store the results of BD analysis.

What is Open Data?

Panton Principals For Research Data

Standards

Metadata

Possible Open Research Data Archiving Implementation At Stellenbosch University

Data-schematic.png


Data Implementation Diagram Notes

More Information On Other Open Data Systems

http://wiki.lib.sun.ac.za/index.php/OpenGIS
http://wiki.lib.sun.ac.za/index.php/OpenSurvey
http://wiki.lib.sun.ac.za/index.php/OpenBiology

Research Data Management Plans (RDMP)

References

CKAN4RDM Discussion

Catalogs

South African

African

International

Infrastructure

Books

Organisations

Training

Conferences

Software

Visualisation

Rankings

Integration

Service Providers

Research

Analysis

News

Graphics