DeepGreen—A Data Hub for the Distribution of Scholarly Articles From Publishers to Open Access Repositories in Germany

Authors: Tomasz StomporHeinz PampelJulia Boltze-FüttererBeate Rusch

  • DeepGreen is an automated delivery service for open access articles. Originally conceived to take advantage of the so-called open access component—a secondary publication right in Alliance and National licences in Germany to promote green open access—it aims to streamline open access processes by automating the distribution of full-text articles and metadata from publishers to repositories.
  • The service, developed by a consortium and funded by the German Research Foundation (DFG) in its initial phase, has successfully established itself as a national service, facilitating open access content distribution and contributing to Germany’s open access infrastructure.
  • As of December 2024, DeepGreen distributes articles from 14 publishers to 84 institutional repositories and 6 subject-specific repositories.
  • This article describes the role of the DeepGreen service in Germany, its collaboration with publishers and the potential of automated processes for storing articles in open access repositories, which, as publicly owned institutional infrastructures, ensure sustainable access and provide secure, redundant storage.

URL : DeepGreen—A Data Hub for the Distribution of Scholarly Articles From Publishers to Open Access Repositories in Germany

DOI : https://doi.org/10.1002/leap.70000

Open science in Spain: Influence of personal and contextual factors on deposit patterns

Author

Background

This study investigates factors influencing the deposit of academic publications and research data in open access repositories by Spanish researchers.

Methods

Using survey data from a sample of Spanish academics, the research examines the impact of personal attributes (e.g., gender, age, knowledge of open science) and contextual variables (e.g., academic discipline, institutional type) on deposit behaviours. Quantitative methods, including chi-square tests and regression analysis, reveal significant associations between knowledge of open science and deposit practices.

Results

Researchers familiar with open science principles were more likely to deposit multiple versions of articles and datasets, albeit with varying intensity. Key findings highlight disciplinary and institutional differences: researchers in Life Sciences and Experimental Sciences showed higher engagement with both article and data deposits, whereas Health Sciences lagged. Gender differences were also observed, with male researchers depositing articles and datasets more frequently than their female counterparts, though age showed limited impact. Public institutions exhibited lower data deposit rates despite mandates supporting open access.

Conclusions

The study underscores the need for tailored policies, including awareness campaigns, infrastructure investment, and discipline-specific strategies, to promote equitable and widespread adoption of open science practices. Findings contribute to understanding open science implementation, emphasizing the interplay of individual, institutional, and systemic factors.

URL : Open science in Spain: Influence of personal and contextual factors on deposit patterns

DOI : https://doi.org/10.12688/f1000research.160207.1

 

Moving Open Repositories out of the Blind Spot of Initiatives to Correct the Scholarly Record

Author : Frédérique Bordignon

Open repositories were created to enhance access and visibility of scholarly publications, driven by open science ideals emphasising transparency and accessibility. However, they lack mechanisms to update the status of corrected or retracted publications, posing a threat to the integrity of the scholarly record. To explore the scope of the problem, a manually verified corpus was examined: we extracted all the entries in the Crossref × Retraction Watch database for which the publication date of the corrected or retracted document ranged from 2013 to 2023.

This corresponded to 24,430 entries with a DOI, which we use to query Unpaywall and identify their possible indexing in HAL, an open repository (second largest institutional repository worldwide). In most cases (91%), HAL does not mention corrections. While the study needs broader scope, it highlights the necessity of improving the role of open repositories in correction processes with better curation practices.

We discuss how harvesting operations and the interoperability of platforms can maintain the integrity of the entire scholarly record. Not only will the open repositories avoid damaging its reliability through ambiguous reporting, but on the contrary, they will also strengthen it.

URL : Moving Open Repositories out of the Blind Spot of Initiatives to Correct the Scholarly Record

DOI : https://doi.org/10.1002/leap.1655

Open But Hidden: Open Access Gaps in the National Science Foundation Public Access Repository

Authors : Kimberly R. Powell, Jenny Townes, Fred Rascoe

Introduction

In 2022, the U.S. government released new guidelines for making publicly funded research open and available. For the National Science Foundation (NSF), these policies reinforce requirements in place since 2016 for supported research to be submitted to the Public Access Repository (PAR).

Methods

To evaluate the public access compliance of research articles submitted to the NSF-PAR, this study searched for NSF-PAR records published between 2017 and 2021 from two research intensive institutions. Records were reviewed to determine whether the PAR held a deposited copy, as required by the 2016 policies, or provided a link out to publisher-held version(s).

Results

A total of 841 unique records were identified, all with publicly accessible versions. Yet only 42% had a deposited PDF version available in the repository as required by the NSF 2016 Public Access Policy. The remaining 58% directed instead to publisher-held versions. In total, only 55% of record links labeled “Full Text Available” directed users to a publicly accessible version with a single click.

Discussion

Records within PAR do not clearly direct users to the publicly accessible full text. In almost half of records, the most prominently displayed link directed users to a paywall version, even when a publicly available version existed. Records accessible only through the CHORUS (Clearing House for the Open Research of the United States) initiative were further obscured by requiring specialized navigation of publisher-owned sites.

Conclusion:

Despite having a repository mandate since 2016, NSF compliance rates remain low. Additional support and/or oversight is needed to address the additional requirements introduced under the 2022 memo.

URL : Open But Hidden: Open Access Gaps in the National Science Foundation Public Access Repository

DOI : https://doi.org/10.31274/jlsc.17767

 

Open Research Data in Spanish University Repositories

Authors : Pablo Monteagudo-Haro, Juan Jose Prieto-Gutierrez

The current situation of open research data in Spanish university repositories is analyzed by means of twelve indicators that allow us to compare them with each other. The twelve self-developed indicators deal with research datasets and institutional policies linked to open access, as well as some of the key characteristics of the repositories.

The methodology used consists of comparing the repositories of the different universities linked to REBIUN. The result has been that datasets in institutional repositories are scarce, and the situation is heterogeneous across the territory. This raises questions about future open access policies for research data in the country’s main scientific institutions.

Arxiv : https://arxiv.org/abs/2411.10470

Closing the Loop: Automating Links to Open Access Versions of Articles in Faculty Profiles at a Large Research Institution

Authors : Andrew M. Johnson, Don Elsborg

Introduction

Universities implement faculty profile systems for a variety of reasons, including to increase visibility of research produced at the institution. These profiles often connect with other campus systems, particularly institutional repositories.

This article describes a collaboration at the University of Colorado Boulder aimed at integrating the institutional repository with the faculty profile system, which then expanded to encompass the automated creation of profile links to open access versions of faculty articles from any journal or repository.

Description of Program

To achieve the initial project goals, a cross-campus team from the University Libraries and the Faculty Information System developed a strategy of using Unpaywall as an intermediary data source to connect the institutional repository with the faculty profile system.

This also allowed for the development of an automated process for generating links to open access content from any journal or repository, which resulted in the creation of over 35,000 links to openly available content in faculty profiles, including over 2,900 links to content in the institutional repository.

These links provide public users of the faculty profile system with a simple way to access all openly available research produced at the university. This article describes the development and implementation of this project as well as lessons learned.

Next Steps

The ongoing collaboration provides additional opportunities to unlock data for monitoring rates of open access publishing and self-archiving, informing library collection development decisions, and connecting to other data sources to reveal further insights.

URL : Closing the Loop: Automating Links to Open Access Versions of Articles in Faculty Profiles at a Large Research Institution

DOI : https://doi.org/10.31274/jlsc.17242