SHS - Horizontal Scalability

Horizontal scalability specification

SHS

Horizontal Scalability Specifications

Version: 0

Author

W Hugo

Draft

17-12-2015

#

Concept

Description

Reference

SHS-01

Self-Curation

Large parts of the data curation life cycle needs to be accessible and under control of the depositor.

 

SHS-02

Crowd-Sourced Curation

It may be possible to utilise third-party contributions in a number of additional ways:

 

1. Gamifying aspects of curation: relies on voluntary contributions by third parties to describe, standardise and publish data.

2. Small Contracts can be used to improve the publication quality of the data sets under curation;

3. Improvement through Use could include additional meta-data, standardisation of schema, syntax, and semantics, and conversion to new formats and schema.

SHS-03

Data Volume

Data volumes are likely to rise significantly as automated observation and data processing snowballs within data-intensive science. It is unlikely to present a significant cost factor

 

SHS-04

Transparent Scalability

Users should not really have to care where data is stored once a decision is made to use a repository provided by a broader infrastructure. The boundary of the physical DIRISA infrastructure is fuzzy.

 

SHS-05

Management of Registered Users

Alignment with facilities such as EDUROAM, with proven scalability and a large contingent of end users already registered can be considered. A parallel option is required for end users that do not qualify as being from educational institutions. NRF RISA administration is also an option for integration – most grant-funded researchers are registered. End users that fall outside these groupings require a basic access – possibly integration with ORCID.

EDUROAM

RISA

ORCID

SHS-06

Cost Recovery

Cost recovery mechanisms should be scalable and linked to Open Access publication processes. Simplest models are based on alignment with grant-funded research.

RISA

 

Document Actions