Introducing a data availability policy for journals at IOP Publishing: Measuring the impact on authors and editorial teams

Authors : Jade Holt, Andrew Walker, Phill Jones

As the open research movement continues to gather pace, a number of publishers, funders, and institutions are mandating the sharing of underlying research data. At the same time, concerns about introducing extra quality control steps around data availability statements (DAS) are driving a discussion about the best way to make data more open without slowing down publication.

This article describes a pilot project to introduce a new Open Data policy to three IOP Publishing (IOPP) journals as part of IOPP’s commitment to increasing transparency and support for open science.

An investigation was undertaken using an automated workflow monitoring tool to understand the impact of this change on authors and the editorial staff. Changes in revised submission processing times and how often manuscripts were returned to the author were measured.

An overall increase in the time editorial staff spent processing manuscripts was found as well as an increase in the number of times manuscripts were returned to authors. Detailed analysis shows that manuscripts in which authors claim in the DAS to have included data within the manuscript were the most strongly affected. Steps to mitigate the effects through improved author communication were found to be effective.

URL : Introducing a data availability policy for journals at IOP Publishing: Measuring the impact on authors and editorial teams

DOI : https://doi.org/10.1002/leap.1386

Between administration and research: Understanding data management practices in an institutional context

Authors : Stefan Reichmann, Thomas Klebel, Ilire Hasani-Mavriqi, Tony Ross-Hellauer

Research Data Management (RDM) promises to make research outputs more transparent, findable, and reproducible. Strategies to streamline data management across disciplines are of key importance.

This paper presents results of an institutional survey (N = 258) at a medium-sized Austrian university with a STEM focus, supplemented with interviews (N = 18), to give an overview of the state-of-play of RDM practices across faculties and disciplinary contexts.

RDM services are on the rise but remain somewhat behind leading countries like the Netherlands and UK, showing only the beginnings of a culture attuned to RDM. There is considerable variation between faculties and institutes with respect to data amounts, complexity of data sets, data collection and analysis, and data archiving.

Data sharing practices within fields tend to be inconsistent. RDM is predominantly regarded as an administrative task, to the detriment of considerations of good research practice. Problems with RDM fall in two categories: Generic problems transcend specific research interests, infrastructures, and departments while discipline-specific problems need a more targeted approach.

The paper extends the state-of-the-art on RDM practices by combining in-depth qualitative material with quantified, detailed data about RDM practices and needs. The findings should be of interest to any comparable research institution with a similar agenda.

URL : Between administration and research: Understanding data management practices in an institutional context

DOI : https://doi.org/10.1002/asi.24492

Research Data Management Challenges in Citizen Science Projects and Recommendations for Library Support Services. A Scoping Review and Case Study

Authors: Jitka Stilund Hansen, Signe Gadegaard, Karsten Kryger Hansen, Asger Væring Larsen, Søren Møller, Gertrud Stougård Thomsen, Katrine Flindt Holmstrand

Citizen science (CS) projects are part of a new era of data aggregation and harmonisation that facilitates interconnections between different datasets. Increasing the value and reuse of CS data has received growing attention with the appearance of the FAIR principles and systematic research data management (RDM) practises, which are often promoted by university libraries.

However, RDM initiatives in CS appear diversified and if CS have special needs in terms of RDM is unclear. Therefore, the aim of this article is firstly to identify RDM challenges for CS projects and secondly, to discuss how university libraries may support any such challenges.

A scoping review and a case study of Danish CS projects were performed to identify RDM challenges. 48 articles were selected for data extraction. Four academic project leaders were interviewed about RDM practices in their CS projects.

Challenges and recommendations identified in the review and case study are often not specific for CS. However, finding CS data, engaging specific populations, attributing volunteers and handling sensitive data including health data are some of the challenges requiring special attention by CS project managers. Scientific requirements or national practices do not always encompass the nature of CS projects.

Based on the identified challenges, it is recommended that university libraries focus their services on 1) identifying legal and ethical issues that the project managers should be aware of in their projects, 2) elaborating these issues in a Terms of Participation that also specifies data handling and sharing to the citizen scientist, and 3) motivating the project manager to good data handling practises.

Adhering to the FAIR principles and good RDM practices in CS projects will continuously secure contextualisation and data quality. High data quality increases the value and reuse of the data and, therefore, the empowerment of the citizen scientists.

URL : Research Data Management Challenges in Citizen Science Projects and Recommendations for Library Support Services. A Scoping Review and Case Study

DOI : http://doi.org/10.5334/dsj-2021-025

Doctoral Students’ Educational Needs in Research Data Management: Perceived Importance and Current Competencies

Author : Jukka Rantasaari

Sound research data management (RDM) competencies are elementary tools used by researchers to ensure integrated, reliable, and re-usable data, and to produce high quality research results.

In this study, 35 doctoral students and faculty members were asked to self-rate or rate doctoral students’ current RDM competencies and rate the importance of these competencies.

Structured interviews were conducted, using close-ended and open-ended questions, covering research data lifecycle phases such as collection, storing, organization, documentation, processing, analysis, preservation, and data sharing.

The quantitative analysis of the respondents’ answers indicated a wide gap between doctoral students’ rated/self-rated current competencies and the rated importance of these competencies.

In conclusion, two major educational needs were identified in the qualitative analysis of the interviews: to improve and standardize data management planning, including awareness of the intellectual property and agreements issues affecting data processing and sharing; and to improve and standardize data documenting and describing, not only for the researcher themself but especially for data preservation, sharing, and re-using. Hence the study informs the development of RDM education for doctoral students.

URL : Doctoral Students’ Educational Needs in Research Data Management: Perceived Importance and Current Competencies

DOI : https://doi.org/10.2218/ijdc.v16i1.684

Manuscript Accepted!: Collaborating on a scholarly publishing symposium for graduate students and early career academic faculty

Authors : Teresa Auch Schultz, Rosalind Bucy, Amy Hunsaker, Amy Shannon, Chrissy Klenke, Iñaki Arrieta Baro

INTRODUCTION

As academic libraries expand their scholarly communication support, they also need to find ways to help educate graduate students about this area as well as market themselves.

DESCRIPTION OF PROGRAM

The University of Nevada, Reno Libraries created a one-day symposium, called Manuscript Accepted!, aimed at graduate students and early career faculty that would use faculty and library expertise to lead panels and workshops.

This article discusses planning for the event, including collaborating with other on-campus groups, working with publishers for financial support, and planning a program that would meet a variety of needs. Assessment of the first two symposiums, held in 2019 and 2020, shows that attendees valued the event while also highlighting the need for more targeted support for specific areas, such as the humanities.

NEXT STEPS

The Libraries plans to continue Manuscript Accepted! as a one-day symposium, although it will also look to ways to expand attendance. Finally, the Libraries is investigating ways to create smaller events that could be tied into the Manuscript Acceptance! brand but that help meet other needs of our attendees.

URL : Manuscript Accepted!: Collaborating on a scholarly publishing symposium for graduate students and early career academic faculty

DOI : https://doi.org/10.7710/2162-3309.2385

Institutional Data Repository Development, a Moving Target

Authors : Colleen Fallaw, Genevieve Schmitt, Hoa Luong, Jason Colwell, Jason Strutz

At the end of 2019, the Research Data Service (RDS) at the University of Illinois at Urbana-Champaign (UIUC) completed its fifth year as a campus-wide service. In order to gauge the effectiveness of the RDS in meeting the needs of Illinois researchers, RDS staff developed a five-year review consisting of a survey and a series of in-depth focus group interviews.

As a result, our institutional data repository developed in-house by University Library IT staff, Illinois Data Bank, was recognized as the most useful service offering by our unit. When launched in 2016, storage resources and web servers for Illinois Data Bank and supporting systems were hosted on-premises at UIUC.

As anticipated, researchers increasingly need to share large, and complex datasets. In a responsive effort to leverage the potentially more reliable, highly available, cost-effective, and scalable storage accessible to computation resources, we migrated our item bitstreams and web services to the cloud. Our efforts have met with success, but also with painful bumps along the way.

This article describes how we supported data curation workflows through transitioning from on-premises to cloud resource hosting. It details our approaches to ingesting, curating, and offering access to dataset files up to 2TB in size–which may be archive type files (e.g., .zip or .tar) containing complex directory structures.

URL : https://journal.code4lib.org/articles/15821

Transparency, provenance and collections as data: the National Library of Scotland’s Data Foundry

Author : Sarah Ames

‘Collections as data’ has become a core activity for libraries in recent years: it is important that we make collections available in machine-readable formats to enable and encourage computational research. However, while this is a necessary output, discussion around the processes and workflows required to turn collections into data, and to make collections data available openly, are just as valuable.

With libraries increasingly becoming producers of their own collections – presenting data from digitisation and digital production tools as part of datasets, for example – and making collections available at scale through mass-digitisation programmes, the trustworthiness of our processes comes into question.

In a world of big data, often of unclear origins, how can libraries be transparent about the ways in which collections are turned into data, how do we ensure that biases in our collections are recognised and not amplified, and how do we make these datasets available openly for reuse?

This paper presents a case study of work underway at the National Library of Scotland to present collections as data in an open and transparent way – from establishing a new Digital Scholarship Service, to workflows and online presentation of datasets.

It considers the changes to existing processes needed to produce the Data Foundry, the National Library of Scotland’s open data delivery platform, and explores the practical challenges of presenting collections as data online in an open, transparent and coherent manner.

URL : Transparency, provenance and collections as data: the National Library of Scotland’s Data Foundry

Original location : https://www.liberquarterly.eu/article/10.18352/lq.10371/