A Generic Research Data Infrastructure for Long Tail Research Data Management

Authors : Atif Latif, Fidan Limani, Klaus Tochtermann

The advent of data intensive science has fueled the generation of digital scientific data. Undoubtedly, digital research data plays a pivotal role in transparency and re-producibility of scientific results as well as in steering the innovation in a research process.

However, the main challenges for science policy and infrastructure projects are to develop practices and solutions for research data management which in compliance with good scientific standards make the research data discoverable, citeble and accessible for society potential reuse.

GeRDI – the Generic Research Data (RD) Infrastructure – is such a research data management initiative which targets long tail content that stems from research communities belonging to different domain and research practices.

It provides a generic and open software which connects research data infrastructures of communities to enable the investigation of multidisciplinary research questions.

URL : A Generic Research Data Infrastructure for Long Tail Research Data Management

DOI : http://doi.org/10.5334/dsj-2019-017

Are Research Datasets FAIR in the Long Run?

Authors : Dennis Wehrle, Klaus Rechert

Currently, initiatives in Germany are developing infrastructure to accept and preserve dissertation data together with the dissertation texts (on state level – bwDATA Diss, on federal level – eDissPlus).

In contrast to specialized data repositories, these services will accept data from all kind of research disciplines. To ensure FAIR data principles (Wilkinson et al., 2016), preservation plans are required, because ensuring accessibility, interoperability and re-usability even for a minimum ten year data redemption period can become a major challenge.

Both for longevity and re-usability, file formats matter. In order to ensure access to data, the data’s encoding, i.e. their technical and structural representation in form of file formats, needs to be understood. Hence, due to a fast technical lifecycle, interoperability, re-use and in some cases even accessibility depends on the data’s format and our future ability to parse or render these.

This leads to several practical questions regarding quality assurance, potential access options and necessary future preservation steps. In this paper, we analyze datasets from public repositories and apply a file format based long-term preservation risk model to support workflows and services for non-domain specific data repositories.

URL : Are Research Datasets FAIR in the Long Run?

DOI : https://doi.org/10.2218/ijdc.v13i1.659

Remediation Data Management Plans : A Tool for Recovering Research Data from Messy, Messy Projects

Author : Clara Llebot

Data Management Plans (DMPs) have been used in the last decade to encourage good data management practices among researchers. DMPs are widely used, preventive tools that encourage good data management practices. DMPs are traditionally used to manage data during the planning stage of the project, often required for grant proposals, and prior to data collection.

In this paper we will use a case study to argue that Data Management Plans can be useful in improving the management of the data of research projects that have moved beyond the planning stage of the research life cycle.

In particular, we focus on the case of active projects where data has already been collected and is still being analyzed.

We discuss the differences and commonalities in structure between preventive Data Management Plans and remedial Data Management Plans, and describe in detail the additional considerations that are needed when writing remedial Data Management Plans: the goals and audience of the document, the data inventory, and an implementation plan.

URL : Remediation Data Management Plans : A Tool for Recovering Research Data from Messy, Messy Projects

DOI : https://doi.org/10.2218/ijdc.v13i1.667

Construction(s) et contradictions des données de recherche en SHS

Auteurs/Authors : Marie-Laure Malingre, Morgane Mignon, Cécile Pierre, Alexandre Serres

La structuration et le partage des données s’imposent depuis cinq ans au monde de la recherche, à travers des injonctions politiques (de Horizon 2020 au Plan national pour la science ouverte).

L’analyse de l’enquête menée en 2017 auprès des chercheurs de l’université Rennes 2 sur leurs pratiques, représentations et attentes en matière de données conduit à interroger le terme lui-même. Variable et complexe, contrairement à ce que suggère le mot « donnée », la notion ne va pas de soi.

L’article s’efforcera de montrer qu’elle fait l’objet d’une triple construction, épistémologique, intellectuelle et politique, dans les discours des chercheurs et des acteurs institutionnels, en tension avec les pratiques constatées sur le terrain.

DOI : https://www.openscience.fr/Construction-s-et-contradictions-des-donnees-de-recherche-en-SHS#

Implementing publisher policies that inform, support and encourage authors to share data: two case studies

Authors: Leila Jones, Rebecca Grant, Iain Hrynaszkiewicz

Open research data is one of the key areas in the expanding open scholarship movement. Scholarly journals and publishers find themselves at the heart of the shift towards openness, with recent years seeing an increase in the number of scholarly journals with data-sharing policies aiming to increase transparency and reproducibility of research.

In this article we present two case studies which examine the experiences that two leading academic publishers, Taylor & Francis and Springer Nature, have had in rolling out data-sharing policies.

We illustrate some of the considerations involved in providing consistent policies across journals of many disciplines, reflecting on successes and challenges.

URL : Implementing publisher policies that inform, support and encourage authors to share data: two case studies

DOI : http://doi.org/10.1629/uksg.463

Ten principles for machine-actionable data management plans

Authors : Tomasz Miksa, Stephanie Simms, Daniel Mietchen, Sarah Jones

Data management plans (DMPs) are documents accompanying research proposals and project outputs. DMPs are created as free-form text and describe the data and tools employed in scientific investigations. They are often seen as an administrative exercise and not as an integral part of research practice.

There is now widespread recognition that the DMP can have more thematic, machine-actionable richness with added value for all stakeholders: researchers, funders, repository managers, research administrators, data librarians, and others.

The research community is moving toward a shared goal of making DMPs machine-actionable to improve the experience for all involved by exchanging information across research tools and systems and embedding DMPs in existing workflows.

This will enable parts of the DMP to be automatically generated and shared, thus reducing administrative burdens and improving the quality of information within a DMP.

This paper presents 10 principles to put machine-actionable DMPs (maDMPs) into practice and realize their benefits. The principles contain specific actions that various stakeholders are already undertaking or should undertake in order to work together across research communities to achieve the larger aims of the principles themselves.

We describe existing initiatives to highlight how much progress has already been made toward achieving the goals of maDMPs as well as a call to action for those who wish to get involved.

URL : Ten principles for machine-actionable data management plans

DOI : https://doi.org/10.1371/journal.pcbi.1006750