Show simple item record

dc.contributor.authorKlump, Jens
dc.contributor.authorWyborn, Lesley
dc.contributor.authorWu, Mingfang
dc.contributor.authorDowns, Robert
dc.contributor.authorAsmi, Ari
dc.contributor.authorRyder, Gerry
dc.contributor.authorMartin, Julia
dc.date.accessioned2023-03-16T20:37:09Z
dc.date.available2023-03-16T20:37:09Z
dc.date.issued2020
dc.identifier.citationKlump, J., Wyborn, L., Downs, R., Asmi, A., Wu, M., Ryder, G., & Martin, J. (2020) Principles and best practices in data versioning for all data sets big and small. Version 1.1. Research Data Alliance, 19pp. DOI: 10.15497/RDA00042.en_US
dc.identifier.urihttps://repository.oceanbestpractices.org/handle/11329/2157
dc.description.abstractThe demand for better reproducibility of research results is growing. More and more data is becoming available online. In some cases, the datasets have become so large that downloading the data is no longer feasible. Data can also be offered through web services and accessed on demand. This means that parts of the data are accessed at a remote source when needed. In this scenario, it will become increasingly important for a researcher to be able to cite the exact extract of the data set that was used to underpin their research publication. However, while the means to identify datasets using persistent identifiers have been in place for more than a decade, systematic data versioning practices are currently not available. Versioning procedures and best practices are well established for scientific software. The related Wikipedia article gives an overview of software versioning practices. The codebase of large software projects does bear some semblance to large dynamic datasets. Are therefore versioning practices for code also suitable for data sets or do we need a separate suite of practices for data versioning? How can we apply our knowledge of versioning code to improve data versioning practices? This Working Group investigated to which extent these practices can be used to enhance the reproducibility of scientific results. The Research Data Alliance (RDA) Data Versioning Working Group produced this white paper to document use cases and practices, and to make recommendations for the versioning of research data. To further adoption of the outcomes, the Working Group contributed selected use cases and recommended data versioning practices to other groups in RDA and W3C. The outcomes of the RDA Data Versioning Working Group add a central element to the systematic management of research data at any scale by providing recommendations for standard practices in the versioning of research data. These practice guidelines are illustrated by a collection of use cases.en_US
dc.language.isoenen_US
dc.publisherResearch Data Alliance (RDA)en_US
dc.subject.otherData versioningen_US
dc.titlePrinciples and best practices in data versioning for all data sets big and small. Version 1.1.en_US
dc.typeReporten_US
dc.description.statusPublisheden_US
dc.format.pages19pp.en_US
dc.contributor.corpauthorResearch Data Alliance Data Versioning Working Groupen_US
dc.description.refereedRefereeden_US
dc.identifier.doi10.15497/RDA00042
dc.subject.parameterDisciplineCross-disciplineen_US
dc.subject.dmProcessesData archival/stewardship/curationen_US
dc.subject.dmProcessesData processingen_US
dc.description.currentstatusCurrenten_US
dc.description.maturitylevelPilot or Demonstrateden_US
dc.description.adoptionValidated (tested by third parties)en_US
dc.description.adoptionMulti-organisationalen_US
dc.description.methodologyTypeSpecification of criteriaen_US
obps.contact.contactemailenquiries@rd-alliance.org
obps.resourceurl.publisherhttps://www.rd-alliance.org/group/data-versioning-wg/outcomes/principles-and-best-practices-data-versioning-all-data-sets-big


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record