The role of online search platforms in scientific diffusion

Author : Kyriakos Drivas

After the launch of Google Scholar older papers experienced an increase in their citations, a finding consistent with a reduction in search costs and introduction of ranking algorithms. I employ this observation to examine how recombination of science takes place in the era of online search platforms.

The findings show that as papers become more discoverable, their knowledge is diffused beyond their own broad field. Results are mixed when examining knowledge diffusion within the same field.

The results contribute to the ongoing debate of narrowing of science. While there might a general reduction in recombination of knowledge across distant fields over the last decades, online search platforms are not the culprits.

URL : The role of online search platforms in scientific diffusion

DOI : https://doi.org/10.1002/asi.24959

When researchers pay to publish: Results from a survey on APCs in four countries

Authors : Osvaldo Gallardo, Matías Milia, André Luiz Appel, Grip-APC Team, François van Schalkwyk

This paper provides an empirical overview of the impact and practices of paying Article Processing Charges (APCs) by four nationally categorized groups of researchers in Argentina, Brazil, Mexico, and South Africa.

The data was collected from 13,577 researchers through an online questionnaire. The analysis compares the practice of publishing in journals that charge APCs across different dimensions, including country, discipline, gender, and age of the researchers.

The paper also focuses on the maximum amount APC paid and the methods and strategies researchers use to cover APC payments, such as waivers, research project funds, payment by coauthors, and the option to publish in closed access, where possible. Different tendencies were identified among the different disciplines and the national systems examined.

Findings show that Argentine researchers apply for waivers most frequently and often use personal funds or international coauthors for APCs, with younger researchers less involved in APC payments. In contrast, Brazil, South Africa, and Mexico have more older researchers, yet younger researchers still publish more in APC journals. South African researchers lead in APC publications, likely due to better funding access and read and publish agreements.

This study lays the groundwork for further analysis of gender asymmetries, funding access, and views on the commercial Open Access model of scientific dissemination.

Arxiv : https://arxiv.org/abs/2410.12144

The Living Library: a process-based tool for open literature review, probing the boundaries of open science

Authors : Elisabeth Angerer, Maura Cassidy Burke, Simon Dirks, Arthur Bakker, Aitana Bilinski Torres, Toine Pieters

The Living Library is a novel tool for opening the scientific process of literature reviewing. We here present its core features, set-up and workflow, and provide the open-source code via GitHub (https://github.com/Simon-Dirks/living-library). The Living Library allows researchers to sort articles thematically and temporally, has a built-in open logbook, and uses a responsive methodology.

These core features render the Living Library both a practical tool, and an educative framework for reflection on the research process. Its use deepened our understanding of what it means and what it takes to open science, which we summarise in three main lessons: openness is multidirectional, involving sharing and receiving; openness is relational and as such requires boundary work; and openness entails judgments of relevance.

This highlights the intimate connection between research relevance and open science: Opening science is no categorical practice, but the continuous syncing to a world in motion—opening up for it and to it, to varying degrees at different boundaries, in response to what is happening and what matters.

The Living Library models what such syncing can look like in relation to the evolving academic conversation. We encourage further experimentation with the Living Library to probe the boundaries of open science.

URL : The Living Library: a process-based tool for open literature review, probing the boundaries of open science

DOI : https://doi.org/10.1007/s43545-024-00964-z

Enabling preprint discovery, evaluation, and analysis with Europe PMC

Authors : Mariia Levchenko, Michael Parkin, Johanna McEntyre, Melissa Harrison

Preprints provide an indispensable tool for rapid and open communication of early research findings. Preprints can also be revised and improved based on scientific commentary uncoupled from journal-organised peer review. The uptake of preprints in the life sciences has increased significantly in recent years, especially during the COVID-19 pandemic, when immediate access to research findings became crucial to address the global health emergency.

With ongoing expansion of new preprint servers, improving discoverability of preprints is a necessary step to facilitate wider sharing of the science reported in preprints. To address the challenges of preprint visibility and reuse, Europe PMC, an open database of life science literature, began indexing preprint abstracts and metadata from several platforms in July 2018. Since then, Europe PMC has continued to increase coverage through addition of new servers, and expanded its preprint initiative to include the full text of preprints related to COVID-19 in July 2020 and then the full text of preprints supported by the Europe PMC funder consortium in April 2022.

The preprint collection can be searched via the website and programmatically, with abstracts and the open access full text of COVID-19 and Europe PMC funder preprint subsets available for bulk download in a standard machine-readable JATS XML format. This enables automated information extraction for large-scale analyses of the preprint corpus, accelerating scientific research of the preprint literature itself.

This publication describes steps taken to build trust, improve discoverability, and support reuse of life science preprints in Europe PMC. Here we discuss the benefits of indexing preprints alongside peer-reviewed publications, and challenges associated with this process.

URL : Enabling preprint discovery, evaluation, and analysis with Europe PMC

DOI : https://doi.org/10.1101/2024.04.19.590240

Re-use of research data in the social sciences. Use and users of digital data archive

Authors : Elina LateI, Michael Ochsner

The aim of this paper is to investigate the re-use of research data deposited in digital data archive in the social sciences. The study examines the quantity, type, and purpose of data downloads by analyzing enriched user log data collected from Swiss data archive. The findings show that quantitative datasets are downloaded increasingly from the digital archive and that downloads focus heavily on a small share of the datasets.

The most frequently downloaded datasets are survey datasets collected by research organizations offering possibilities for longitudinal studies. Users typically download only one dataset, but a group of heavy downloaders form a remarkable share of all downloads. The main user group downloading data from the archive are students who use the data in their studies. Furthermore, datasets downloaded for research purposes often, but not always, serve to be used in scholarly publications.

Enriched log data from data archives offer an interesting macro level perspective on the use and users of the services and help understanding the increasing role of repositories in the social sciences. The study provides insights into the potential of collecting and using log data for studying and evaluating data archive use.

URL : Re-use of research data in the social sciences. Use and users of digital data archive

DOI : https://doi.org/10.1371/journal.pone.0303190

Text mining arXiv: a look through quantitative finance papers

Author : Michele Leonardo Bianchi

This paper explores articles hosted on the arXiv preprint server with the aim to uncover valuable insights hidden in this vast collection of research. Employing text mining techniques and through the application of natural language processing methods, we examine the contents of quantitative finance papers posted in arXiv from 1997 to 2022.

We extract and analyze crucial information from the entire documents, including the references, to understand the topics trends over time and to find out the most cited researchers and journals on this domain. Additionally, we compare numerous algorithms to perform topic modeling, including state-of-the-art approaches.

Arxiv : https://arxiv.org/abs/2401.01751

Data journals: incentivizing data access and documentation within the scholarly communication system

Author : William H. Walters

Data journals provide strong incentives for data creators to verify, document and disseminate their data. They also bring data access and documentation into the mainstream of scholarly communication, rewarding data creators through existing mechanisms of peer-reviewed publication and citation tracking.

These same advantages are not generally associated with data repositories, or with conventional journals’ data-sharing mandates. This article describes the unique advantages of data journals.

It also examines the data journal landscape, presenting the characteristics of 13 data journals in the fields of biology, environmental science, chemistry, medicine and health sciences.

These journals vary considerably in size, scope, publisher characteristics, length of data reports, data hosting policies, time from submission to first decision, article processing charges, bibliographic index coverage and citation impact.

They are similar, however, in their peer review criteria, their open access license terms and the characteristics of their editorial boards.

URL : Data journals: incentivizing data access and documentation within the scholarly communication system

DOI : http://doi.org/10.1629/uksg.510