Funding models for Open Access digital data repositories

Purpose

The purpose of this paper is to examine funding models for Open Access (OA) digital data repositories whose costs are not wholly core funded. Whilst such repositories are free to access, they are not without significant cost to build and maintain and the lack of both full core costs and a direct funding stream through payment-for-use poses a considerable financial challenge, placing their future and the digital collections they hold at risk.

Design/methodology/approach

The authors document 14 different potential funding streams for OA digital data repositories, grouped into six classes (institutional, philanthropy, research, audience, service, volunteer), drawing on the ongoing experiences of seeking a sustainable funding for the Digital Repository of Ireland (DRI).

Findings

There is no straight forward solution to funding OA digital data repositories that are not wholly core funded, with a number of general and specific challenges facing each repository, and each funding model having strengths and weaknesses. The proposed DRI solution is the adoption of a blended approach that seeks to ameliorate cyclical effects across funding streams by generating income from a number of sources rather than overly relying on a single one, though it is still reliant on significant state core funding to be viable.

Practical implications

The detailing of potential funding streams offers practical financial solutions to other OA digital data repositories which are seeking a means to become financially sustainable in the absence of full core funding.

Originality/value

The review assesses and provides concrete advice with respect to potential funding streams in order to help repository owners address the financing conundrum they face.

URL : http://dx.doi.org/10.1108/OIR-01-2015-0031

A Platform for Closing the Open Data Feedback Loop Based on Web2.0 Functionality

Statut

“One essential characteristic of open data ecosystems is their development through feedback loops, discussions and dynamic data suppliers – user interactions. These user-centric features communicate the users’ needs to the open data community, as well to the public sector organizations responsible for data publication. Addressing these needs by the corresponding public sector organizations, or even by utilising the power of the community as ENGAGE supports, can significantly promote and accelerate innovation. However, such elements appear barely to be part of existing open data practices in the public sector. A survey we conducted has shown that professional open data users find the feedback and discussion on open data infrastructures from their users to their providers as highly useful and important, but they state that they do not know at least one open data infrastructure that provides various types of discussion, and feedback mechanisms.

In this paper we describe and discuss an open data platform, which contributes to filling this gap and also present a usage scenario of it, explaining the sequence of using its functionality. The discussed open data infrastructure combines functionalities that aim to close the feedback loop and to return information to public authorities that can be useful for better government data opening and publication, as well as establishing communication channels between all stakeholders. This may effectively lead to the stimulation and facilitation of value generation from open data, as such functionality positions the user at the centre of the open data publication process.”

URL : A Platform for Closing the Open Data Feedback Loop Based on Web2.0 Functionality

Alternative URL : http://www.jedem.org/article/view/327/270

Issues in the development of open access to research data

This paper explores key issues in the development of open access to research data. The use of digital means for developing, storing and manipulating data is creating a focus on ‘data-driven science’. One aspect of this focus is the development of ‘open access’ to research data.

Open access to research data refers to the way in which various types of data are openly available to public and private stakeholders, user communities and citizens. Open access to research data, however, involves more than simply providing easier and wider access to data for potential user groups. The development of open access requires attention to the ways data are considered in different areas of research.

We identify how open access is being unevenly developed across the research environment and the consequences this has in terms of generating data gaps. Data gaps refer to the way data becomes detached from published conclusions. To address these issues, we examine four main areas in developing open access to research data: stakeholder roles and values; technological requirements for managing and sharing data; legal and ethical regulations and procedures; institutional roles and policy frameworks.

We conclude that problems of variability and consistency across the open access ecosystem need to be addressed within and between these areas to ensure that risks surrounding a data gap are managed in open access.

URL : http://dx.doi.org/10.1080/08109028.2014.956505

Publishing the British National Bibliography as Linked Open Data

Statut

“This paper describes the development of a linked data instance of the British National Bibliography (BNB) by the British Library. The focus is on the development of an RDF (Resource Description Framework) data model and the technical process to convert MARC 21 Bibliographic Data to Linked Data using existing resources. BNB was launched as linked open data in 2011 on a Talis platform. In 2013 it was migrated to a new platform, hosted by TSO. The paper discusses issues arising from the development, implementation and running of a linked data service. It also looks ahead to plans for future developments”

URL : Publishing the British National Bibliography as Linked Open Data

Alternative URL : http://www.bl.uk/bibliographic/pdfs/publishing_bnb_as_lod.pdf

The Dawn of Open Access to Phylogenetic Data

Statut

“The scientific enterprise depends critically on the preservation of and open access to published data. This basic tenet applies acutely to phylogenies (estimates of evolutionary relationships among species). Increasingly, phylogenies are estimated from increasingly large, genome-scale datasets using increasingly complex statistical methods that require increasing levels of expertise and computational investment. Moreover, the resulting phylogenetic data provide an explicit historical perspective that critically informs research in a vast and growing number of scientific disciplines. One such use is the study of changes in rates of lineage diversification (speciation – extinction) through time. As part of a meta-analysis in this area, we sought to collect phylogenetic data (comprising nucleotide sequence alignment and tree files) from 217 studies published in 46 journals over a 13-year period. We document our attempts to procure those data (from online archives and by direct request to corresponding authors), and report results of analyses (using Bayesian logistic regression) to assess the impact of various factors on the success of our efforts. Overall, complete phylogenetic data for of these studies are effectively lost to science. Our study indicates that phylogenetic data are more likely to be deposited in online archives and/or shared upon request when: (1) the publishing journal has a strong data-sharing policy; (2) the publishing journal has a higher impact factor, and; (3) the data are requested from faculty rather than students. Importantly, our survey spans recent policy initiatives and infrastructural changes; our analyses indicate that the positive impact of these community initiatives has been both dramatic and immediate. Although the results of our study indicate that the situation is dire, our findings also reveal tremendous recent progress in the sharing and preservation of phylogenetic data.”

URL : The Dawn of Open Access to Phylogenetic Data

DOI: 10.1371/journal.pone.0110268

7R Data Value Framework for Open Data in Practice: Fusepool

Statut

“Based on existing literature, this article makes a case for open (government) data as supporting political efficiency, socio-economic innovation and administrative efficiency, but also finds a lack of measurable impact. It attributes the lack of impact to shortcomings regarding data access (must be efficient) and data usefulness (must be effective). To address these shortcomings, seven key activities that add value to data are identified and are combined into the 7R Data Value Framework, which is an applied methodology for linked data to systematically address both technical and social shortcomings. The 7R Data Value Framework is then applied to the international Fusepool project that develops a set of integrated software components to ease the publishing of open data based on linked data and associated best practices. Real-life applications for the Dutch Parliament and the Libraries of Free University of Berlin are presented, followed by a concluding discussion.”

URL: 7R Data Value Framework for Open Data in Practice: Fusepool

Alternative URL: http://www.mdpi.com/1999-5903/6/3/556