Open Access — Towards a non-normative and systematic understanding

Authors : Niels Taubert, Anne Hobert, Nicolas Fraser, Najko Jahn, Elham Iravani

The term Open Access not only describes a certain model of scholarly publishing — namely in digital format freely accessible to readers — but often also implies that free availability of research results is desirable, and hence has a normative character.

Together with the large variety of presently used definitions of different Open Access types, this normativity hinders a systematic investigation of the development of open availability of scholarly literature.

In this paper, we propose a non-normative definition of Open Access and its usage as a neutral, descriptive term in bibliometric studies and research on science.

To this end, we first specify what normative figures are commonly associated with the term Open Access and then develop a neutral definition. We further identify distinguishing characteristics of openly accessible literature, called dimensions, and derive a classification scheme into Open Access categories based on these dimensions.

Additionally, we present an operationalisation method to assign scientific publications to the respective categories in practice. Here, we describe useful data sources, which can be employed to gather the information needed for the classification of scholarly works according to the presented classification scheme.

URL : https://arxiv.org/abs/1910.11568

Different Preservation Levels: The Case of Scholarly Digital Editions

Authors : Elias Oltmanns, Tim Hasler, Wolfgang Peters-Kottig, Heinz-Günter Kuper

Ensuring the long-term availability of research data forms an integral part of data management services. Where OAIS compliant digital preservation has been established in recent years, in almost all cases the services aim at the preservation of file-based objects.

In the Digital Humanities, research data is often represented in highly structured aggregations, such as Scholarly Digital Editions. Naturally, scholars would like their editions to remain functionally complete as long as possible.

Besides standard components like webservers, the presentation typically relies on project specific code interacting with client software like webbrowsers. Especially the latter being subject to rapid change over time invariably makes such environments awkward to maintain once funding has ended.

Pragmatic approaches have to be found in order to balance the curation effort and the maintainability of access to research data over time. A sketch of four potential service levels aiming at the long-term availability of research data in the humanities is outlined: (1) Continuous Maintenance, (2) Application Conservation, (3) Application Data Preservation, and (4) Bitstream Preservation.

The first being too costly and the last hardly satisfactory in general, we suggest that the implementation of services by an infrastructure provider should concentrate on service levels 2 and 3. We explain their strengths and limitations considering the example of two Scholarly Digital Editions.

URL : Different Preservation Levels: The Case of Scholarly Digital Editions

DOI : http://doi.org/10.5334/dsj-2019-051

From Academia to Software Development: Publication Citations in Source Code Comments

Authors : Akira Inokuchi, Yusuf Sulistyo Nugroho, Fumiaki Konishi, Hideaki Hata, Akito Monden, Kenichi Matsumoto

Academic publications have been evaluated with the impact on research communities based on the number of citations. On the other hand, the impact of academic publications on industry has been rarely studied.

This paper investigates how academic publications contribute to software development by analyzing publication citations in source code comments in open source software repositories.

We propose an automated approach of detecting academic publications based on Named Entity Recognition, and achieve 0.90 in F1 as detection accuracy. We conduct a large-scale study of publication citations with 319,438,977 comments collected from active 25,925 repositories written in seven programming languages.

Our findings indicate that academic publications can be knowledge sources of software development, and there can be potential issues of obsoleting knowledge.

URL : https://arxiv.org/abs/1910.06932

The diverse niches of megajournals: Specialism within generalism

Authors: Kyle Siler, Vincent Larivière, Cassidy R. Sugimoto

Over the past decade, megajournals have expanded in popularity and established a legitimate niche in academic publishing. Leveraging advantages of digital publishing, megajournals are characterized by large publication volume, broad interdisciplinary scope, and peer‐review filters that select primarily for scientific soundness as opposed to novelty or originality.

These publishing innovations are complementary and competitive vis‐à‐vis traditional journals. We analyze how megajournals (PLOS One, Scientific Reports) are represented in different fields relative to prominent generalist journals (Nature, PNAS, Science) and “quasi‐megajournals” (Nature Communications, PeerJ).

Our results show that both megajournals and prominent traditional journals have distinctive niches, despite the similar interdisciplinary scopes of such journals.

These niches—defined by publishing volume and disciplinary diversity—are dynamic and varied over the relatively brief histories of the analyzed megajournals. Although the life sciences are the predominant contributor to megajournals, there is variation in the disciplinary composition of different megajournals.

The growth trajectories and disciplinary composition of generalist journals—including megajournals—reflect changing knowledge dissemination and reward structures in science.

URL : The diverse niches of megajournals: Specialism within generalism

DOI : https://doi.org/10.1002/asi.24299

Revisiting “the 1990s debutante”: Scholar‐led publishing and the prehistory of the open access movement

Author : Samuel A. Moore

The movement for open access publishing (OA) is often said to have its roots in the scientific disciplines, having been popularized by scientific publishers and formalized through a range of top‐down policy interventions. But there is an often‐neglected prehistory of OA that can be found in the early DIY publishers of the late 1980s and early 1990s.

Managed entirely by working academics, these journals published research in the humanities and social sciences and stand out for their unique set of motivations and practices.

This article explores this separate lineage in the history of the OA movement through a critical‐theoretical analysis of the motivations and practices of the early scholar‐led publishers.

Alongside showing the involvement of the humanities and social sciences in the formation of OA, the analysis reveals the importance that these journals placed on experimental practices, critique of commercial publishing, and the desire to reach new audiences.

Understood in today’s context, this research is significant for adding complexity to the history of OA, which policymakers, advocates, and publishing scholars should keep in mind as OA goes mainstream.

DOI : https://doi.org/10.1002/asi.24306

The Future of OA: A large-scale analysis projecting Open Access publication and readership

Authors : Heather Piwowar, Jason Priem, Richard Orr

Understanding the growth of open access (OA) is important for deciding funder policy, subscription allocation, and infrastructure planning.

This study analyses the number of papers available as OA over time. The models includes both OA embargo data and the relative growth rates of different OA types over time, based on the OA status of 70 million journal articles published between 1950 and 2019.

The study also looks at article usage data, analyzing the proportion of views to OA articles vs views to articles which are closed access. Signal processing techniques are used to model how these viewership patterns change over time. Viewership data is based on 2.8 million uses of the Unpaywall browser extension in July 2019.

We found that Green, Gold, and Hybrid papers receive more views than their Closed or Bronze counterparts, particularly Green papers made available within a year of publication. We also found that the proportion of Green, Gold, and Hybrid articles is growing most quickly.

In 2019:

  • 31% of all journal articles are available as OA
  • 52% of article views are to OA article

Given existing trends, we estimate that by 2025:

  • 44% of all journal articles will be available as OA

  • 70% of article views will be to OA articles

The declining relevance of closed access articles is likely to change the landscape of scholarly communication in the years to come.

URL : The Future of OA: A large-scale analysis projecting Open Access publication and readership

DOI : https://doi.org/10.1101/795310

Data papers as a new form of knowledge organization in the field of research data

Authors : Joachim Schöpfel, Dominic Farace, Hélène Prost, Antonella Zane

Data papers have been defined as scholarly journal publications whose primary purpose is to describe research data. Our survey provides more insights about the environment of data papers, i.e. disciplines, publishers and business models, and about their structure, length, formats, metadata and licensing.

Data papers are a product of the emerging ecosystem of data-driven open science. They contribute to the FAIR principles for research data management. However, the boundaries with other categories of academic publishing are partly blurred. Data papers are (can be) generated automatically and are potentially machine-readable.

Data papers are essentially information, i.e. description of data, but also partly contribute to the generation of knowledge and data on its own. Part of the new ecosystem of open and data-driven science, data papers and data journals are an interesting and relevant object for the assessment and understanding of the transition of the former system of academic publishing.

URL : https://halshs.archives-ouvertes.fr/ISKOFRANCE2019/halshs-02284548