Réflexions sur le fragment dans les pratiques scientifiques en ligne : entre matérialité documentaire et péricope

Auteurs/Authors : Gérald Kembellec, Thomas Bottini

Cette communication propose une réflexion pluridisciplinaire (SIC, ingénierie documentaire et théorie du document numérique, informatique, « humanités numériques », histoire des pratiques savantes) sur les usages du fragment dans les pratiques documentaires scientifiques en ligne.

En prolongement de ces éléments théoriques sont proposés un modèle théorique de la segmentation des contenus en unités de sens (péricope) et des directions d’implémentation.

URL : https://hal-univ-paris10.archives-ouvertes.fr/hal-01700064

Studying Conceptual Models for Publishing Library Data to the Semantic Web

Author : Sofia Zapounidou

This thesis studies the library data and the way that linked data technologies may affect libraries. The thesis aims to contribute to the research regarding the devel-opment and implementation of a framework for the integration of bibliographic data in the semantic web.

It seeks to make sound propositions for the interopera-bility of conceptual bibliographic models, as well as for future library systems and search environments integrating bibliographic information.

DOI : http://eprints.rclis.org/32108/

Biotea: semantics for Pubmed Central

Authors : Alexander Garcia​, Federico Lopez, Leyla Garcia, Olga Giraldo, Victor Bucheli, Michel Dumontier

A significant portion of biomedical literature is represented in a manner that makes it difficult for consumers to find or aggregate content through a computational query. One approach to facilitate reuse of the scientific literature is to structure this information as linked data using standardized web technologies.

In this paper we present the second version of Biotea, a semantic, linked data version of the open-access subset of PubMed Central that has been enhanced with specialized annotation pipelines that uses existing infrastructure from the National Center for Biomedical Ontology.

We expose our models, services, software and datasets. Our infrastructure enables manual and semi-automatic annotation, resulting data are represented as RDF-based linked data and can be readily queried using the SPARQL query language.

We illustrate the utility of our system with several use cases. Our datasets, methods and techniques are available at http://biotea.github.io.

URL : Biotea: semantics for Pubmed Central

DOI : https://doi.org/10.7717/peerj.4201

Open data, [open] access: linking data sharing and article sharing in the Earth Sciences

Author : Samantha Teplitzky

Introduction

The norms of a research community influence practice, and norms of openness and sharing can be shaped to encourage researchers who share in one aspect of their research cycle to share in another.

Different sets of mandates have evolved to require that research data be made public, but not necessarily articles resulting from that collected data. In this paper, I ask to what extent publications in the Earth Sciences are more likely to be open access (in all of its definitions) when researchers open their data through the Pangaea repository.

Methods

Citations from Pangaea data sets were studied to determine the level of open access for each article.

Results

This study finds that the proportion of gold open access articles linked to the repository increased 25% from 2010 to 2015 and 75% of articles were available from multiple open sources.

Discussion

The context for increased preference for gold open access is considered and future work linking researchers’ decisions to open their work to the adoption of open access mandates is proposed.

URL : Open data, [open] access: linking data sharing and article sharing in the Earth Sciences

DOI : http://doi.org/10.7710/2162-3309.2150

Linked Data is People: Building a Knowledge Graph to Reshape the Library Staff Directory

Authors : Jason A. Clark, Scott W. H. Young

One of our greatest library resources is people. Most libraries have staff directory information published on the web, yet most of this data is trapped in local silos, PDFs, or unstructured HTML markup.

With this in mind, the library informatics team at Montana State University (MSU) Library set a goal of remaking our people pages by connecting the local staff database to the Linked Open Data (LOD) cloud.

In pursuing linked data integration for library staff profiles, we have realized two primary use cases: improving the search engine optimization (SEO) for people pages and creating network graph visualizations.

In this article, we will focus on the code to build this library graph model as well as the linked data workflows and ontology expressions developed to support it. Existing linked data work has largely centered around machine-actionable data and improvements for bots or intelligent software agents.

Our work demonstrates that connecting your staff directory to the LOD cloud can reveal relationships among people in dynamic ways, thereby raising staff visibility and bringing an increased level of understanding and collaboration potential for one of our primary assets: the people that make the library happen.

URL : http://journal.code4lib.org/articles/12320

Decentralized provenance-aware publishing with nanopublications

Authors : Tobias Kuhn, Christine Chichester, Michael Krauthammer, Núria Queralt-Rosinach, Ruben Verborgh, George Giannakopoulos, Axel-Cyrille Ngonga Ngomo, Raffaele Viglianti, Michel Dumontier

Publication and archival of scientific results is still commonly considered the responsability of classical publishing companies. Classical forms of publishing, however, which center around printed narrative articles, no longer seem well-suited in the digital age.

In particular, there exist currently no efficient, reliable, and agreed-upon methods for publishing scientific datasets, which have become increasingly important for science. In this article, we propose to design scientific data publishing as a web-based bottom-up process, without top-down control of central authorities such as publishing companies.

Based on a novel combination of existing concepts and technologies, we present a server network to decentrally store and archive data in the form of nanopublications, an RDF-based format to represent scientific data.

We show how this approach allows researchers to publish, retrieve, verify, and recombine datasets of nanopublications in a reliable and trustworthy manner, and we argue that this architecture could be used as a low-level data publication layer to serve the Semantic Web in general.

Our evaluation of the current network shows that this system is efficient and reliable.

URL : Decentralized provenance-aware publishing with nanopublications

DOI : https://doi.org/10.7717/peerj-cs.78

The Paradox of Privacy: Revisiting a Core Library Value in an Age of Big Data and Linked Data

Protecting user privacy and confidentiality is fundamental to the ethics and practice of librarianship, and such protection constitutes one of eleven values in the American Library Association’s “Core Values of Librarianship” (2004).

This paper addresses the concerns of protecting privacy in the library as they relate to library users who are defining, exploring, and negotiating their sexual identities with the help of the library’s information, programming, and physical facilities.

In so doing, we enlist the aid of Garret Keizer, who, in Privacy (2012), articulates a fresh theory of the concept in light of American social life in the twenty-first century. Using Keizer’s theory, we examine these concerns within the context of the rise of big data systems and social media on the one hand, and linked data and new cataloging standards on the other.

In so doing, we suggest that linked data technologies, with their ability to lead searchers through self-directed, open inquiry, are superior to big data technologies in the navigation of the paradox between openness and secrecy.

In this way they offer a greater potential to support the needs of queer library users: lesbian, gay, bisexual, transgendered, or questioning (LGBTQ).

URL : http://muse.jhu.edu/journals/library_trends/v064/64.3.campbell.html