Challenges and opportunities in the evolving digital preservation landscape: reflections from Portico

Authors: Kate Wittenberg, Sarah Glasser, Amy Kirchhoff, Sheila Morrissey, Stephanie Orphan

There has been tremendous growth in the amount of digital content created by libraries, publishers, cultural institutions and the general public. While there are great benefits to having content available in digital form, digital objects can be extremely short-lived unless proper attention is paid to preservation.

Reflecting on our experience with the digital preservation service Portico, we provide background on Portico’s history and evolving practice of sustainable preservation of the digital artifacts of scholarly communications.

We also provide an overview of the digital preservation landscape as we see it now, with some thoughts on current requirements for preservation, and thoughts on the opportunities and challenges that lie ahead.

URL : Challenges and opportunities in the evolving digital preservation landscape: reflections from Portico

DOI : http://doi.org/10.1629/uksg.421

The Modern Research Data Portal: a design pattern for networked, data-intensive science

Authors : Kyle Chard, Eli Dart, Ian Foster​, David Shifflett, Steven Tuecke, Jason Williams

We describe best practices for providing convenient, high-speed, secure access to large data via research data portals. We capture these best practices in a new design pattern, the Modern Research Data Portal, that disaggregates the traditional monolithic web-based data portal to achieve orders-of-magnitude increases in data transfer performance, support new deployment architectures that decouple control logic from data storage, and reduce development and operations costs.

We introduce the design pattern; explain how it leverages high-performance data enclaves and cloud-based data management services; review representative examples at research laboratories and universities, including both experimental facilities and supercomputer sites; describe how to leverage Python APIs for authentication, authorization, data transfer, and data sharing; and use coding examples to demonstrate how these APIs can be used to implement a range of research data portal capabilities.

Sample code at a companion web site, https://docs.globus.org/mrdp, provides application skeletons that readers can adapt to realize their own research data portals.

URL : The Modern Research Data Portal: a design pattern for networked, data-intensive science

DOI : https://doi.org/10.7717/peerj-cs.144

Interoperability and FAIRness through a novel combination of Web technologies

Authors : Mark D. Wilkinson, Ruben Verborgh, Luiz Olavo Bonino da Silva Santos, Tim Clark, Morris A. Swertz, Fleur D.L. Kelpin, Alasdair J.G. Gray, Erik A. Schultes, Erik M. van Mulligen, Paolo Ciccarese, Arnold Kuzniar, Anand Gavai, Mark Thompson, Rajaram Kaliyaperumal, Jerven T. Bolleman, Michel Dumontier

Data in the life sciences are extremely diverse and are stored in a broad spectrum of repositories ranging from those designed for particular data types (such as KEGG for pathway data or UniProt for protein data) to those that are general-purpose (such as FigShare, Zenodo, Dataverse or EUDAT).

These data have widely different levels of sensitivity and security considerations. For example, clinical observations about genetic mutations in patients are highly sensitive, while observations of species diversity are generally not.

The lack of uniformity in data models from one repository to another, and in the richness and availability of metadata descriptions, makes integration and analysis of these data a manual, time-consuming task with no scalability.

Here we explore a set of resource-oriented Web design patterns for data discovery, accessibility, transformation, and integration that can be implemented by any general- or special-purpose repository as a means to assist users in finding and reusing their data holdings.

We show that by using off-the-shelf technologies, interoperability can be achieved atthe level of an individual spreadsheet cell. We note that the behaviours of this architecture compare favourably to the desiderata defined by the FAIR Data Principles, and can therefore represent an exemplar implementation of those principles.

The proposed interoperability design patterns may be used to improve discovery and integration of both new and legacy data, maximizing the utility of all scholarly outputs.

URL : Interoperability and FAIRness through a novel combination of Web technologies

DOI : https://doi.org/10.7717/peerj-cs.110

Calenge par Bertrand, parcours de lecture dans le Carnet d’un bibliothécaire : Du blog au book

Auteur/Author : Jérôme pouchol

Nous sommes tous redevables à Bertrand Calenge, bibliothécaire de renom, théoricien et praticien des bibliothèques, disparu en 2016.

Un collectif de bibliothécaires fait revivre cet auteur, en proposant un parcours de lecture à travers son blog Carnet de notes.

Ces parcours thématiques et transversaux recontextualisent les billets selon les principaux sujets traités par l’auteur – collections, médiation, évaluation, métier, numérique, etc. – autant dire toutes les questions vives des bibliothèques.

Ce livre expérimente une mise en book du blog d’un professionnel, pour nous inviter, comme l’écrit Martine Poulain dans sa préface, « à penser, échanger, proposer ».

URL : Calenge par Bertrand, parcours de lecture dans le Carnet d’un bibliothécaire : Du blog au book

Alternative location : http://www.enssib.fr/presses/catalogue/calenge-par-bertrand-parcours-de-lecture-dans-le-carnet-dun-bibliothecaire

Do altmetrics assess societal impact in the same way as case studies? An empirical analysis testing the convergent validity of altmetrics based on data from the UK Research Excellence Framework (REF)

Authors : Lutz Bornmann, Robin Haunschild, Jonathan Adams

Altmetrics have been proposed as a way to assess the societal impact of research. Although altmetrics are already in use as impact or attention metrics in different contexts, it is still not clear whether they really capture or reflect societal impact.

This study is based on altmetrics, citation counts, research output and case study data from the UK Research Excellence Framework (REF), and peers’ REF assessments of research output and societal impact. We investigated the convergent validity of altmetrics by using two REF datasets: publications submitted as research output (PRO) to the REF and publications referenced in case studies (PCS).

Case studies, which are intended to demonstrate societal impact, should cite the most relevant research papers. We used the MHq’ indicator for assessing impact – an indicator which has been introduced for count data with many zeros.

The results of the first part of the analysis show that news media as well as mentions on Facebook, in blogs, in Wikipedia, and in policy-related documents have higher MHq’ values for PCS than for PRO.

Thus, the altmetric indicators seem to have convergent validity for these data. In the second part of the analysis, altmetrics have been correlated with REF reviewers’ average scores on PCS. The negative or close to zero correlations question the convergent validity of altmetrics in that context.

We suggest that they may capture a different aspect of societal impact (which can be called unknown attention) to that seen by reviewers (who are interested in the causal link between research and action in society).

URL : https://arxiv.org/abs/1807.03977

Health science libraries in Sweden: new directions, expanding roles

Authors : Lotta Haglund, Annikki Roos, Petra Wallgren-Björk

Librarians in Sweden are facing huge challenges in meeting the demands of their organisations and users. This article looks at four key areas: coping with open science/open access initiatives; increasing demands from researchers for support doing systematic reviews; understanding user experiences in Swedish health science libraries; and the consequences of expanding roles for recruitment and continuing professional development.

With regard to changing roles, there is an increasing shift from the generalist towards the expert role. The authors raise the issue as to how to prepare those new to the profession to the changing environment of health science libraries.

URL : Health science libraries in Sweden: new directions, expanding roles

DOI : https://doi.org/10.1111/hir.12229

Understanding Open Knowledge in China: A Chinese Approach to Openness?

Authors: Lucy Montgomery, Xiang Ren

This paper examines the development of open knowledge in China through two case studies: the development of Chinese open access (OA) journals, and national-level OA repositories.

Open access and open knowledge are emerging as a site of both grass-roots activism, and top-down intervention in the practices of scholarship and scholarly publishing in China. Although the language, vision and strategies of the global open knowledge movement are undoubtedly present, so too are the messy realities of open access and open knowledge innovation in a local context.

In attempting to position open access developments in China within a diverse and contested global landscape of open knowledge innovation we draw on Moore’s (2017) conception of open access as a boundary object: an object that is understood differently within individual communities but which maintains enough structure to be understood between communities (Moore 2017; Star and Griesemer 1989).

Viewed as a boundary object, the concept of open knowledge is making it possible for China to engage with the global open knowledge movement, as a beneficiary of the innovation of others, and as an open knowledge innovator in its own right.

URL : Understanding Open Knowledge in China: A Chinese Approach to Openness?

DOI : http://doi.org/10.5334/csci.106