Open Data, Open Society. A research project about openness of public data in EU local administration

This report discusses the current and potential role, in a truly open society, of raw Public Sector Information (PSI) that is really open, that is fully accessible and reusable by everybody. The general characteristics of PSI and the conclusions are based on previous studies and on the analysis of
current examples both from the European Union and the rest of the world.

Generation, management and usage of data constituting what is normally called PSI is a very large topic. This report only focuses on some parts of it. First of all, we only look here at really “public” PSI, that is information (from maps to aggregate health data) that is not tied to any single individual and whose publication, therefore, raises no privacy issues.

It is also important to distinguish between actual raw data (basic elements of information like numbers, names, dates, single geographical features like the shape of a lake, addresses…), their results (more or less complex documents, policies, laws…) and the procedures and chains of command followed to generate and use such results, that is to vote or, inside Public Administrations, to take or implement decisions.

So far, discussion and research on Open Data at national level has had relatively more coverage, even if much of the PSI that has the most direct impact on the life of most citizens is the one that is generated, managed and used by local, not central, administrations and end users (citizens, businesses or other organizations). Creation of wealth and jobs can be easier, faster and cheaper to stimulate, especially in times of economic crisis, at the local level. Finally, open access to public data is much more necessary for small businesses that for big corporations, since the latter can afford to pay for access to data anyway (and high prices of data may also protect them from competition from smaller companies).

For all these reasons, the main focus of this report will be on the raw data that constitute “public” PSI as defined above. This is the reason why in this report the terms “raw data” and “PSI” are practically interchangeable. We will also focus on the local dimension of Open PSI, that is raw data
directly produced by, or directly relevant for, local communities (City and Regions), and on their
direct impact on local government and local economy.

Chapters 2 and 3 summarize the importance of data in the modern society and some recent developments on the Open Data front in Europe. Chapter 4 explains why raw PSI should be open, while Chapter 5 shows the potential of such data with a few real world examples from several (mostly EU) countries. Chapter 6 looks at some dangers that should not be ignored when promoting Open Data and Chapter 7 proposes some general practices to follow for getting the most out of them. Some conclusions and the next phases of the project are in Chapter 8.

URL : http://www.dime-eu.org/files/active/0/ODOS_report_1.pdf

An Open Access Future? Report from the eurocancercoms project

“In 2010, the European Association for Cancer Research (EACR) as a member of the Eurocancercoms FP7 project conducted a survey on professional communication activities across its European membership with particular reference to the use of the internet and barriers to communication. Over half of the survey respondents were working in basic cancer research, a further third in translational research and the remaining respondents in epidemiology or medical oncology. From a range of interesting information and opinions, the survey revealed that the internet is used by 94% of cancer researchers for professional activities every day with the majority accessing PubMed and online journals daily or 2-3 times a week. These simple statistics place access to published research findings online at the centre of support for cancer researchers’ work: a crucial sharing of information which can accelerate progress in the scientific battle with cancer.

While the survey had not focussed on Open Access specifically, comment banks and discussions at consensus meetings following the publication of the survey results highlighted the issue of access to subscription journals, the barrier to essential and urgent information that a ‘paywall’ creates, and the need for free access. A second survey picking up on the issue of Open Access publishing has now been completed. This paper shares the results of that survey, which was again conducted across the European membership of EACR, and cross references responses with selected data from the Study of Open Access Publishing (SOAP) 2011 which was undertaken across all academic disciplines. The SOAP data is freely accessible and can be mined for information by anyone who wishes to use it. A number of questions were included in the survey that mirrored those used by SOAP, allowing the direct comparison of results. In this article a comparison has been made between the responses provided by cancer researchers and the 7,433 respondents to the SOAP survey from the Biological Sciences. (Over 43,000 responses were received across all disciplines to the SOAP survey.)

As a conclusion and invitation to further discussion, this paper also contributes to the debate around subscription and Open Access publishing, supporting the case for accelerating the progress towards Open Access publishing of cancer research articles as a particularly supportive way of assisting all researchers to make unhindered progress with their work.”

URL : http://www.eacr.org/about/20110820_Open%20Access%20Future.pdf

The internet and science communication blurring the boundaries…

The internet and science communication: blurring the boundaries :

“Scientific research is heavily dependent on communication and collaboration. Research does not exist in a bubble; scientific work must be communicated in order to add it to the body of knowledge within a scientific community, so that its members may ‘stand on the shoulders of giants’ and benefit from all that has come before. The effectiveness of scientific communication is crucial to the pace of scientific progress: in all its forms it enables ideas to be formulated, results to be compared, and replications and improvements to be made. The sharing of science is a foundational aspect of the scientific method. This paper, part of the policy research within the FP7 EUROCANCERCOMS project, discusses how the Internet has changed communication by cancer researchers and how it has the potential to change it still more in the future. It will detail two broad types of communication: formal and informal, and how these are changing with the use of new web tools and technologies.”

URL : http://digital-scholarship.org/digitalkoans/2011/08/29/an-open-access-future-report-from-the-eurocancercoms-project/

Open access to scientific knowledge and feudalism knowledge: Is there a connection?

The role of universities and transnational corporations in the circulation of scientific knowledge is considered. If institutions generate, mostly scientific knowledge, trying to facilitate its free circulation, then transnational companies, contrarily, try to remove most significant and cutting-edge scientific knowledge from free circulation and its commercialization and reintroduction into an open, but now commercial, circulation in the TRIPS.

However, paradoxical, the open access movement to scientific knowledge, eventually, facilitates feudalism of knowledge. We call this phenomenon the ‘open access – paradox’. Based on the experiments done with Google Scholar and Google Patents, it is shown that universities generates, mostly scientific knowledge (scientific articles), and transnational companies generates, mostly technological knowledge (patents).

URL : http://www.webology.org/2011/v8n1/a83.html

Wikis in scholarly publishing Scientific research is…

Wikis in scholarly publishing :

“Scientific research is a process concerned with the creation, collective accumulation, contextualization, updating and maintenance of knowledge. Wikis provide an environment that allows to collectively accumulate, contextualize, update and maintain knowledge in a coherent and transparent fashion. Here, we examine the potential of wikis as platforms for scholarly publishing. In the hope to stimulate further discussion, the article itself was drafted on Species ID – http://species-id.net; a wiki that hosts a prototype for wiki-based scholarly publishing – where it can be updated, expanded or otherwise improved.”

URL : http://iospress.metapress.com/content/q42617538838t6j2/

Open access journals – what publishers offer what…

Open access journals – what publishers offer, what researchers want :

“The SOAP (Study of Open Access Publishing) project has analyzed the current supply and demand situation in the open access journal landscape. Starting from the Directory of Open Access Journals, several sources of data were considered, including journal websites and direct inquiries within the publishing industry to comprehensively map the present supply of online peer-reviewed OA journals. The demand for open access publishing is summarised, as assessed through a large-scale survey of researchers’ opinions and attitudes. Some forty thousand answers were collected across disciplines and around the world, reflecting major support for the idea of open access, while highlighting drivers of and barriers to open access publishing.”

URL : http://iospress.metapress.com/content/2j27028w417x60v8/

Extracting Transforming and Archiving Scientific Data It…

Extracting, Transforming and Archiving Scientific Data :

“It is becoming common to archive research datasets that are not only large but also numerous. In addition, their corresponding metadata and the software required to analyse or display them need to be archived. Yet the manual curation of research data can be difficult and expensive, particularly in very large digital repositories, hence the importance of models and tools for automating digital curation tasks. The automation of these tasks faces three major challenges: (1) research data and data sources are highly heterogeneous, (2) future research needs are difficult to anticipate, (3) data is hard to index. To address these problems, we propose the Extract, Transform and Archive (ETA) model for managing and mechanizing the curation of research data. Specifically, we propose a scalable strategy for addressing the research-data problem, ranging from the extraction of legacy data to its long-term storage. We review some existing solutions and propose novel avenues of research.”

URL : http://arxiv.org/abs/1108.4041