The Future of Data in Research Publishing: From Nice to Have to Need to Have?

Authors : Christine L. Borgman, Amy Brand

Science policy promotes open access to research data for purposes of transparency and reuse of data in the public interest. We expect demands for open data in scholarly publishing to accelerate, at least partly in response to the opacity of artificial intelligence algorithms.

Open data should be findable, accessible, interoperable, and reusable (FAIR), and also trustworthy and verifiable. The current state of open data in scholarly publishing is in transition from ‘nice to have’ to ‘need to have.’

Research data are valuable, interpretable, and verifiable only in context of their origin, and with sufficient infrastructure to facilitate reuse. Making research data useful is expensive; benefits and costs are distributed unevenly.

Open data also poses risks for provenance, intellectual property, misuse, and misappropriation in an era of trolls and hallucinating AI algorithms. Scholars and scholarly publishers must make evidentiary data more widely available to promote public trust in research.

To make research processes more trustworthy, transparent, and verifiable, stakeholders need to make greater investments in data stewardship and knowledge infrastructures.

DOI : https://doi.org/10.1162/99608f92.b73aae77

More than data repositories: perceived information needs for the development of social sciences and humanities research infrastructures

Authors : Anna Sendra, Elina Late, Sanna Kumpulainen

Introduction

The digitalization of social sciences and humanities research necessitates research infrastructures. However, this transformation is still incipient, highlighting the need to better understand how to successfully support data-intensive research.

Method

Starting from a case study of building a national infrastructure for conducting data-intensive research, this study aims to understand the information needs of digital researchers regarding the facility and explore the importance of evaluation in its development.

Analysis

Thirteen semi-structured interviews with social sciences and humanities scholars and computer and data scientists processed through a thematic analysis revealed three themes (developing a research infrastructure, needs and expectations of the research infrastructure, and an approach to user feedback and user interactions).

Results

Findings reveal that developing an infrastructure for conducting data-intensive research is a complicated task influenced by contrasting information needs between social sciences and humanities scholars and computer and data scientists, such as the demand for increased support of the former. Findings also highlight the limited role of evaluation in its creation.

Conclusions

The development of infrastructures for conducting data-intensive research requires further discussion that particularly considers the disciplinary differences between social sciences and humanities scholars and computer and data scientists. Suggestions on how to better design this kind of facilities are also raised.

URL : More than data repositories: perceived information needs for the development of social sciences and humanities research infrastructures

DOI : https://doi.org/10.47989/ir284598

Enquête quantitative sur les pratiques et les besoins des chercheurs sur la gestion des données de la recherche, algorithmes et codes sources dans les établissements du site toulousain

Authors : Danielle Brunet, Soraya Demay, Pierre Diaz, Borbala Goncz, Laure Leclerc, Flora Poupinot, Sibilla Michelle

Le Comité de réflexion pour le partage et la valorisation des données de la recherche et la coordination de la Science Ouverte (CéSO) de l’Université de Toulouse a réalisé une enquête quantitative sur la gestion des données de la recherche, algorithmes et codes sources.

Adressée à l’ensemble de la communauté scientifique du site toulousain, son objectif était de produire un état des lieux des pratiques, des connaissances et des besoins des chercheurs en matière de gestion des données de la recherche. Les résultats permettront de préciser l’offre de services proposée sur le site toulousain.

Cette enquête concerne les établissements membres de l’Université de Toulouse ainsi que les organismes de recherche partenaires : Université Toulouse Capitole, Université Toulouse – Jean Jaurès, Université Toulouse III – Paul Sabatier, Institut national polytechnique de Toulouse (Toulouse INP), Institut national des sciences appliquées de Toulouse (INSA Toulouse), Institut supérieur de l’aéronautique et de l’espace (ISAE-SUPAERO), Institut national universitaire Champollion (INU Champollion), École nationale de l’aviation civile (ENAC), École nationale d’ingénieurs de Tarbes (ENIT), École nationale supérieure d’architecture de Toulouse (ENSA Toulouse), École nationale vétérinaire de Toulouse (ENVT), École nationale supérieure de formation de l’enseignement agricole (ENSFEA), Institut catholique d’arts et métiers (ICAM), École nationale supérieure des mines d’Albi-Carmaux (IMT Mines d’Albi), Toulouse Business School (TBS), Centre national d’études spatiales (CNES), Centre national de la recherche scientifique (CNRS), Institut national de recherche pour l’agriculture, l’alimentation et l’environnement (INRAE), Institut national de l’a santé et de la recherche médicale (Inserm), Institut de recherche pour le développement (IRD) ; Office national d’études et de recherche aérospatiales (Onera), Météo-France.

URL : Enquête quantitative sur les pratiques et les besoins des chercheurs sur la gestion des données de la recherche, algorithmes et codes sources dans les établissements du site toulousain

Original location : https://ut3-toulouseinp.hal.science/hal-04262708v1/

The Effects of Research Data Management Services: Associating the Data Curation Lifecycle with Open Research Output

Authors : Nicolas Pares, Peter Organisciak

This study seeks to understand the relationship between research data management (RDM) services framed in the data curation life cycle and the production of open data. An electronic questionnaire was distributed to US researchers and RDM specialists, and the results were analyzed using Chi-Square tests for association.

The data curation life cycle does associate with the production of open data and shareable research, but tasks like data management plans have stronger associations with the production of open data. The findings analyze the intersection of these concepts and provide insight into RDM services that facilitate the production of open data and shareable research.

URL : The Effects of Research Data Management Services: Associating the Data Curation Lifecycle with Open Research Output

DOI : https://doi.org/10.5860/crl.84.5.751

It Takes a Researcher to Know a Researcher: Academic Librarian Perspectives Regarding Skills and Training for Research Data Support in Canada

Author : Alisa B. Rod

Objective

This empirical study aims to contribute qualitative evidence on the perspectives of data-related librarians regarding the necessary skills, education, and training for these roles in the context of Canadian academic libraries.

A second aim of this study is to understand the perspectives of data-related librarians regarding the specific role of the MLIS in providing relevant training and education. The definition of a data-related librarian in this study includes any librarian or professional who has a conventional title related to a field of data librarianship (i.e., research data management, data services, GIS, data visualization, data science) or any other librarian or professional whose duties include providing data-related services within an academic institution.

Methods

This study incorporates in-depth qualitative empirical evidence in the form of 12 semi-structured interviews of data-related librarians to investigate first-hand perspectives on the necessary skills required for such positions and the mechanisms for acquiring and maintaining such skills.

Results

The interviews identified four major themes related to the skills required for library-related data services positions, including the perceived importance of experience conducting original research, proficiency in computational coding and quantitative methods, MLIS-related skills such as understanding metadata, and the ability to learn new skills quickly on the job.

Overall, the implication of this study regarding the training from MLIS programs concerning data-related librarianship is that although expertise in metadata, documentation, and information management are vital skills for data-related librarians, the MLIS is increasingly less competitive compared with degree programs that offer a greater emphasis on practical experience working with different types of data in a research context and implementing a variety of methodological approaches.

Conclusion

This study demonstrates that an in-depth qualitative portrait of data-related librarians within a national academic ecosystem provides valuable new insights regarding the perceived importance of conducting original empirical research to succeed in these roles.

URL : It Takes a Researcher to Know a Researcher: Academic Librarian Perspectives Regarding Skills and Training for Research Data Support in Canada

DOI : https://doi.org/10.18438/eblip30297

Open science in Sámi research: Researchers’ dilemmas

Author : Coppélie Cocq

This article discusses the challenges of Indigenous research in relation to open science, more particularly in relation to Sámi research in Sweden. Based on interviews with active scholars in the multidisciplinary field of Sámi studies, and on policy documents by Sámi organizations, this article points at the challenges that can be identified, and the practices and strategies adopted or suggested by researchers.

Topics addressed include ownership, control, sensitivity and accessibility of data, the consequences of experienced limitations, the role of the historical context, and community-groundedness.

This article has the ambition to contribute with a discussion about the tensions between standards of data management/open science and data sovereignty in Indigenous contexts. This is done by bringing in perspectives from Indigenous methodologies (the 4 R) and by contextualizing research practices and forms of data colonialism in relation to our contemporary context of surveillance culture.

Research—in relation to ethics and social sustainability—is an arena where tensions between various agendas becomes obvious. This is illustrated in this article by researchers’ dilemmas when working with open science and the advancement of Indigenous research.

Efforts toward ethically valid and cultural-sensitive modes of data use are taking shape in Indigenous research, calling for an increased awareness about the topic. In the context of Sámi research, the role of academia in such a transformation is also essential.

URL : Open science in Sámi research: Researchers’ dilemmas

DOI : https://doi.org/10.3389/frma.2023.1095169

Evolution of research data management in academic libraries: A review of the literature

Authors : Arslan Sheikh, Amara Malik, Rubina Adnan

This study provides insights into the evolution and conceptual framework of research data management (RDM). It also investigates the role of libraries and librarians in offering data management services and the challenges they face in this regard.

The study is qualitative in nature and based on an extensive literature review survey. The analysis of the reviewed literature reveals that the idea of RDM has emerged as a new addition to library research support services.

The more recent literature clearly established the pivotal role of libraries and librarians in developing and managing RDM services. However, data sharing practices and the development of RDM services in libraries are more prevalent in developed countries.

While these trends are still lacking among researchers and libraries in developing countries. Creating awareness among researchers about the benefits of data sharing is a challenging task for libraries.

Furthermore, institutional commitment, collaboration, academic engagement, technological infrastructure development, lack of policies, funding, and storage, skills, and competencies required for librarians to offer RDM-based services are some of the other significant challenges highlighted in the literature.

Certainly, RDM services are difficult and complicated; therefore, librarians need to master the skills of research data to offer library-based RDM services.

DOI : https://doi.org/10.1177/02666669231157405