An analysis of the effects of sharing research data, code, and preprints on citations

Authors : Giovanni Colavizza, Lauren Cadwallader, Marcel LaFlamme, Grégory Dozot, Stéphane Lecorney, Daniel Rappo, Iain Hrynaszkiewicz

Calls to make scientific research more open have gained traction with a range of societal stakeholders. Open Science practices include but are not limited to the early sharing of results via preprints and openly sharing outputs such as data and code to make research more reproducible and extensible. Existing evidence shows that adopting Open Science practices has effects in several domains.

In this study, we investigate whether adopting one or more Open Science practices leads to significantly higher citations for an associated publication, which is one form of academic impact. We use a novel dataset known as Open Science Indicators, produced by PLOS and DataSeer, which includes all PLOS publications from 2018 to 2023 as well as a comparison group sampled from the PMC Open Access Subset. In total, we analyze circa 122’000 publications. We calculate publication and author-level citation indicators and use a broad set of control variables to isolate the effect of Open Science Indicators on received citations.

We show that Open Science practices are adopted to different degrees across scientific disciplines. We find that the early release of a publication as a preprint correlates with a significant positive citation advantage of about 20.2% on average. We also find that sharing data in an online repository correlates with a smaller yet still positive citation advantage of 4.3% on average.

However, we do not find a significant citation advantage for sharing code. Further research is needed on additional or alternative measures of impact beyond citations. Our results are likely to be of interest to researchers, as well as publishers, research funders, and policymakers.

Arxiv : https://arxiv.org/abs/2404.16171

Data services at the academic library: a natural history of horses and unicorns

Authors : Jeffrey Oliver, Fernando Rios, Kiriann Carin, Chun Ly

Objective

Increases in data-intensive research at colleges and universities is driving demand for data services provided by academic libraries. The current work investigates the distribution of library data services, how such services are offered, and the effect of resourcing on the amount of services offered by a library.

Methods

We used a web-based inventory of 25 academic libraries at U.S. Research 1 (R1) Carnegie institutions to assess the state of data services at university libraries. We categorized and quantified services, and tested for an effect of library resourcing on the size of library data service portfolios.

Results

Support for data management and geospatial services was relatively widespread, with increasing support in areas of data analyses and data visualization. There was significant variation among services in the modality in which they were offered (web, consult, instruction) and library resourcing had a significant effect on the number of data services a library offered.

Conclusions

While a core subset of these data services are offered at most academic libraries, more specialized topics are restricted to well-resourced libraries. In light of the influence of resource scarcity on the number of services a library can offer, intra- and inter-campus partnerships will be critical to ensure campus support for data service needs.

URL : Data services at the academic library: a natural history of horses and unicorns

DOI : https://doi.org/10.7191/jeslib.780

Assessing Quality Variations in Early Career Researchers’ Data Management Plans

Author : Jukka Rantasaari

This paper aims to better understand early career researchers’ (ECRs’) research data management (RDM) competencies by assessing the contents and quality of data management plans (DMPs) developed during a multi-stakeholder RDM course. We also aim to identify differences between DMPs in relation to several background variables (e.g., discipline, course track).

The Basics of Research Data Management (BRDM) course has been held in two multi-faculty, research-intensive universities in Finland since 2020. In this study, 223 ECRs’ DMPs created in the BRDM of 2020 – 2022 were assessed, using the recommendations and criteria of the Finnish DMP Evaluation Guide + General Finnish DMP Guidance (FDEG).

The median quality of DMPs appeared to be satisfactory. The differences in rating according to FDEG’s three-point performance criteria were statistically insignificant between DMPs developed in separate years, course tracks or disciplines. However, using content analysis, differences were found between disciplines or course tracks regarding DMP’s key characteristics such as sharing, storing, and preserving data.

DMPs that contained a data table (DtDMPs) also differed highly significantly from prose DMPs. DtDMPs better acknowledged the data handling needs of different data types and improved the overall quality of a DMP.

The results illustrated that the ECRs had learned the basic RDM competencies and grasped their significance to the integrity, reliability, and reusability of data. However, more focused, further training to reach the advanced competency is needed, especially in areas of handling and sharing personal data, legal issues, long-term preserving, and funders’ data policies.

Equally important to the cultural change when RDM is an organic part of the research practices is to merge research support services, processes, and infrastructure into the research projects’ processes. Additionally, incentives are needed for sharing and reusing data.

URL : Assessing Quality Variations in Early Career Researchers’ Data Management Plans

DOI : https://doi.org/10.2218/ijdc.v18i1.873

Between Flat-Earthers and Fitness Coaches: Who is Citing Scientific Publications in YouTube Video Descriptions?

Authors : Olga Zagovora, Katrin Weller

In this study, we undertake an extensive analysis of YouTube channels that reference research publications in their video descriptions, offering a unique insight into the intersection of digital media and academia. Our investigation focuses on three principal aspects: the background of YouTube channel owners, their thematic focus, and the nature of their operational dynamics, specifically addressing whether they work individually or in groups. Our results highlight a strong emphasis on content related to science and engineering, as well as health, particularly in channels managed by individual researchers and academic institutions.

However, there is a notable variation in the popularity of these channels, with professional YouTubers and commercial media entities often outperforming in terms of viewer engagement metrics like likes, comments, and views. This underscores the challenge academic channels face in attracting a wider audience. Further, we explore the role of academic actors on YouTube, scrutinizing their impact in disseminating research and the types of publications they reference.

Despite a general inclination towards professional academic topics, these channels displayed a varied effectiveness in spotlighting highly cited research. Often, they referenced a wide array of publications, indicating a diverse but not necessarily impact-focused approach to content selection.

Arxiv : https://arxiv.org/abs/2404.15083

Rapport d’Enquête Création d’une revue d’articles sur des jeux de données Data Journal SHS

Auteur.ices/Authors : Laurence Bizien, Véronique Cohoner, Fiona Edmond, Arnaud Natal, Pierre Peraldi-Mittelette

La présente enquête a été menée dans le cadre du projet de création d’une revue de données interdisciplinaire en Sciences Humaines et Sociales à l’horizon 2025. Le groupe de travail (GT) œuvrant à ce projet a vu le jour suite à la journée d’études organisée par la Maison des Sciences de l’Homme Lorraine le 10 mars 2023; intitulée : « Un data journal interdisciplinaire pour les sciences humaines et sociales. Enjeux scientifiques et mise en œuvre pratique »

URL : Rapport d’Enquête Création d’une revue d’articles sur des jeux de données Data Journal SHS

HAL : https://hal.univ-lorraine.fr/hal-04541094

The role of non-scientific factors vis-a-vis the quality of publications in determining their scholarly impact

Authors : Giovanni Abramo, Ciriaco Andrea D’Angelo, Leonardo Grilli

In the evaluation of scientific publications’ impact, the interplay between intrinsic quality and non-scientific factors remains a subject of debate. While peer review traditionally assesses quality, bibliometric techniques gauge scholarly impact. This study investigates the role of non-scientific attributes alongside quality scores from peer review in determining scholarly impact.

Leveraging data from the first Italian Research Assessment Exercise (VTR 2001-2003) and Web of Science citations, we analyse the relationship between quality scores, non-scientific factors, and publication short- and long-term impact.

Our findings shed light on the significance of non-scientific elements overlooked in peer review, offering policymakers and research management insights in choosing evaluation methodologies. Sections delve into the debate, identify non-scientific influences, detail methodologies, present results, and discuss implications.

Arxiv : https://arxiv.org/abs/2404.05345