Investigation and Development of the Workflow to Clarify Conditions of Use for Research Data Publishing in Japan

Authors : Yasuyuki Minamiyama, Ui Ikeuchi, Kunihiko Ueshima, Nobuya Okayama, Hideaki Takeda

With the recent Open Science movement and the rise of data-intensive science, many efforts are in progress to publish research data on the web. To reuse published research data in different fields, they must be made more generalized, interoperable, and machine-readable.

Among the various issues related to data publishing, the conditions of use are directly related to their reuse potential. We show herein the types of external constraints and conditions of use in research data publishing in a Japanese context through the analysis of the interview and questionnaire for practitioners.

Although the conditions of research data use have been discussed only in terms of their legal constraints, we organize the inclusion of the non-legal constraints and data holders’ actual requirements.

Furthermore, we develop practical guideline for examining effective data publishing flow with licensing scenarios. This effort can be positioned to develop an infrastructure for data-intensive science, which will contribute to the realization of Open Science.

URL : Investigation and Development of the Workflow to Clarify Conditions of Use for Research Data Publishing in Japan

DOI : http://doi.org/10.5334/dsj-2020-053

Data journals: incentivizing data access and documentation within the scholarly communication system

Author : William H. Walters

Data journals provide strong incentives for data creators to verify, document and disseminate their data. They also bring data access and documentation into the mainstream of scholarly communication, rewarding data creators through existing mechanisms of peer-reviewed publication and citation tracking.

These same advantages are not generally associated with data repositories, or with conventional journals’ data-sharing mandates. This article describes the unique advantages of data journals.

It also examines the data journal landscape, presenting the characteristics of 13 data journals in the fields of biology, environmental science, chemistry, medicine and health sciences.

These journals vary considerably in size, scope, publisher characteristics, length of data reports, data hosting policies, time from submission to first decision, article processing charges, bibliographic index coverage and citation impact.

They are similar, however, in their peer review criteria, their open access license terms and the characteristics of their editorial boards.

URL : Data journals: incentivizing data access and documentation within the scholarly communication system

DOI : http://doi.org/10.1629/uksg.510

Open science-based framework to reveal open data publishing: an experience from using Common Crawl

Authors : Andreiwid Correa, Israel Fernandes

The publishing of open data is considered a key element for civic participation paving the way to the ‘public value’, a term which underpins the social contribution. A result of that can be seen through the popularity of data portals published all around the world by governments, public and private organizations.

However, the diffusion of data portals raises concerns about discoverability and validity of these data sources, especially to what extent they contribute to open data and open science.

The purpose of this work is to develop a framework to reveal open data publishing with the use of a freely available open science project called Common Crawl. The idea is to identify open data-related initiatives and to gather information about their availability, having in the framework’s essence an iterative and differential process.

The main outcome is shown through a proposed model for the historical data repository which involves both use and creation of open science to branch new sort of research possibilities based on publishing of derived data.

URL : https://hal.archives-ouvertes.fr/hal-02544245

Disciplinary data publication guides

Authors : Zosia Beckles, Stephen Gray, Debra Hiom, Kirsty Merrett, Kellie Snow, Damian Steer

Many academic disciplines have very comprehensive standard for data publication and clear guidance from funding bodies and academic publishers. In other cases, whilst much good-quality general guidance exists, there is a lack of information available to researchers to help them decide which specific data elements should be shared.

This is a particular issue for disciplines with very varied data types, such as engineering, and presents an unnecessary barrier to researchers wishing to meet funder expectations on data sharing.

This article outlines a project to provide simple, visual, discipline-specific guidance on data publication, undertaken at the University of Bristol at the request of the Faculty of Engineering.

URL : Disciplinary data publication guides

DOI : https://doi.org/10.2218/ijdc.v13i1.603

Nanopublications: A Growing Resource of Provenance-Centric Scientific Linked Data

Authors : Tobias Kuhn, Albert Meroño-Peñuela, Alexander Malic, Jorrit H. Poelen, Allen H. Hurlbert, Emilio Centeno Ortiz, Laura I. Furlong, Núria Queralt-Rosinach, Christine Chichester, Juan M. Banda, Egon Willighagen, Friederike Ehrhart, Chris Evelo, Tareq B. Malas, Michel Dumontier

Nanopublications are a Linked Data format for scholarly data publishing that has received considerable uptake in the last few years. In contrast to the common Linked Data publishing practice, nanopublications work at the granular level of atomic information snippets and provide a consistent container format to attach provenance and metadata at this atomic level.

While the nanopublications format is domain-independent, the datasets that have become available in this format are mostly from Life Science domains, including data about diseases, genes, proteins, drugs, biological pathways, and biotic interactions.

More than 10 million such nanopublications have been published, which now form a valuable resource for studies on the domain level of the given Life Science domains as well as on the more technical levels of provenance modeling and heterogeneous Linked Data.

We provide here an overview of this combined nanopublication dataset, show the results of some overarching analyses, and describe how it can be accessed and queried.

URL : https://arxiv.org/abs/1809.06532

Connecting Data Publication to the Research Workflow: A Preliminary Analysis

Authors : Sünje Dallmeier-Tiessen, Varsha Khodiyar, Fiona Murphy, Amy Nurnberger, Lisa Raymond, Angus Whyte

The data curation community has long encouraged researchers to document collected research data during active stages of the research workflow, to provide robust metadata earlier, and support research data publication and preservation.

Data documentation with robust metadata is one of a number of steps in effective data publication. Data publication is the process of making digital research objects ‘FAIR’, i.e. findable, accessible, interoperable, and reusable; attributes increasingly expected by research communities, funders and society.

Research data publishing workflows are the means to that end. Currently, however, much published research data remains inconsistently and inadequately documented by researchers.

Documentation of data closer in time to data collection would help mitigate the high cost that repositories associate with the ingest process. More effective data publication and sharing should in principle result from early interactions between researchers and their selected data repository.

This paper describes a short study undertaken by members of the Research Data Alliance (RDA) and World Data System (WDS) working group on Publishing Data Workflows. We present a collection of recent examples of data publication workflows that connect data repositories and publishing platforms with research activity ‘upstream’ of the ingest process.

We re-articulate previous recommendations of the working group, to account for the varied upstream service components and platforms that support the flow of contextual and provenance information downstream.

These workflows should be open and loosely coupled to support interoperability, including with preservation and publication environments. Our recommendations aim to stimulate further work on researchers’ views of data publishing and the extent to which available services and infrastructure facilitate the publication of FAIR data.

We also aim to stimulate further dialogue about, and definition of, the roles and responsibilities of research data services and platform providers for the ‘FAIRness’ of research data publication workflows themselves.

URL : Connecting Data Publication to the Research Workflow: A Preliminary Analysis

DOI : https://doi.org/10.2218/ijdc.v12i1.533