Étiquette : Linked Data

Linking Data Citation to Repository Visibility: An Empirical Study

Auteur de l’article Par Hans Dillaerts
Date de l’article 23 juin 2025

Authors : Fakhri Momeni, Janete Saldanha Bach, Brigitte Mathiak, Peter Mutschke

In today’s data-driven research landscape, dataset visibility and accessibility play a crucial role in advancing scientific knowledge. At the same time, data citation is essential for maintaining academic integrity, acknowledging contributions, validating research outcomes, and fostering scientific reproducibility.

As a critical link, it connects scholarly publications with the datasets that drive scientific progress. This study investigates whether repository visibility influences data citation rates. We hypothesize that repositories with higher visibility, as measured by search engine metrics, are associated with increased dataset citations.

Using OpenAlex data and repository impact indicators (including the visibility index from Sistrix, the h-index of repositories, and citation metrics such as mean and median citations), we analyze datasets in Social Sciences and Economics to explore their relationship. Our findings suggest that datasets hosted on more visible web domains tend to receive more citations, with a positive correlation observed between web domain visibility and dataset citation counts, particularly for datasets with at least one citation. However, when analyzing domain-level citation metrics, such as the h-index, mean, and median citations, the correlations are inconsistent and weaker.

While higher visibility domains tend to host datasets with greater citation impact, the distribution of citations across datasets varies significantly. These results suggest that while visibility plays a role in increasing citation counts, it is not the sole factor influencing dataset citation impact. Other elements, such as dataset quality, research trends, and disciplinary norms, can also contribute to citation patterns.

URL : Linking Data Citation to Repository Visibility: An Empirical Study

DOI : https://doi.org/10.48550/arXiv.2506.09530

Étiquettes Brigitte Mathiak, Fakhri Momeni, Janete Saldanha Bach, Linked Data, open repositories, Peter Mutschke, scientific communication

Semantic micro-contributions with decentralized nanopublication services

Auteur de l’article Par Hans Dillaerts
Date de l’article 5 juin 2021

Authors : Tobias Kuhn, Ruben Taelman, Vincent Emonet, Haris Antonatos, Stian Soiland-Reyes, Michel Dumontier

While the publication of Linked Data has become increasingly common, the process tends to be a relatively complicated and heavy-weight one. Linked Data is typically published by centralized entities in the form of larger dataset releases, which has the downside that there is a central bottleneck in the form of the organization or individual responsible for the releases.

Moreover, certain kinds of data entries, in particular those with subjective or original content, currently do not fit into any existing dataset and are therefore more difficult to publish.

To address these problems, we present here an approach to use nanopublications and a decentralized network of services to allow users to directly publish small Linked Data statements through a simple and user-friendly interface, called Nanobench, powered by semantic templates that are themselves published as nanopublications.

The published nanopublications are cryptographically verifiable and can be queried through a redundant and decentralized network of services, based on the grlc API generator and a new quad extension of Triple Pattern Fragments.

We show here that these two kinds of services are complementary and together allow us to query nanopublications in a reliable and efficient manner. We also show that Nanobench makes it indeed very easy for users to publish Linked Data statements, even for those who have no prior experience in Linked Data publishing.

URL : Semantic micro-contributions with decentralized nanopublication services

DOI : https://doi.org/10.7717/peerj-cs.387

Étiquettes Haris Antonatos, Linked Data, Linked Open Data, Michel Dumontier, nanopublications, Ruben Taelman, scientific communication, semantic publishing, Semantic Web, Stian Soiland-Reyes, Tobias Kuhn, Vincent Emonet

Non classé

Comparing the diffusion and adoption of linked data and research data management services among libraries

Auteur de l’article Par Hans Dillaerts
Date de l’article 16 juin 2020

Author : Jinfang Niu

Introduction

Libraries face innovations periodically. It is important to identify consistent patterns in the diffusion and adoption of innovations so that libraries and relevant stakeholders will be informed and well-prepared for future innovations.

Method

This paper compares findings from two previous projects, each of which was conducted to investigate the diffusion and adoption of two recent innovations, research data management service and linked data, respectively.

The two projects were conducted using similar methods: collecting and analysing literature about the adoption of these innovations in libraries in the United States. Literature was collected through Google Scholar search, citation chasing, and target search for people or libraries that are involved in their adoption.

Analysis

The gathered articles were then coded and analysed based on diffusion of innovation theories.

Results

Similarities and disparities between the diffusion and adoption of the two innovations were identified.

Conclusions

Findings from this study are informative for the decision-making of libraries, librarians, funders, and professional associations facing future innovations. They also contribute to diffusion of innovation theories through revealing new communication channels and alternative adoption processes, as well as redefining existing concepts.

URL : http://www.informationr.net/ir/25-2/paper855.html

Étiquettes academic libraries, Jinfang Niu, Linked Data, research data, research data management

Semantic publishing, la sémantique dans la sémiotique des codes sources d’écrits d’écran scientifiques

Auteur de l’article Par Hans Dillaerts
Date de l’article 18 février 2020

Auteur/Author : Gérald Kembellec

Cet article analyse les enjeux du semantic publishing en contexte scientifique et examine sous un axe sémiotique les codes sources qui en sont le vecteur de propagation.

Sont présentés et discutés les différents signes passeurs qui rendent possible le maillage de l’écriture fragmentaire en réseau : le RDFa, les microdonnées et le JSON-LD par exemple. Leurs usages sont ici analysés et mis en relation avec les besoins et objectifs des chercheurs, qu’ils soient auteurs ou lecteurs.

Enfin, le futur du semantic publishing scientifique est anticipé de manière critique et des points de vigilance sont évoqués tant sur la gouvernance des autorités et des schémas qui étayent le linked data que sur les tentations d’user et d’abuser des bénéfices communicationnels annexes entre médiation et médiatisation.

URL : https://lesenjeux.univ-grenoble-alpes.fr/2019/dossier/04-semantic-publishing-la-semantique-dans-la-semiotique-des-codes-sources-decrits-decran-scientifiques

Étiquettes Gérald Kembellec, Linked Data, Scholarly Publishing, scientific communication, semantic publishing, Software Source Code

Towards Trusted Identities for Swiss Researchers and their Data

Auteur de l’article Par Hans Dillaerts
Date de l’article 5 février 2020

Authors : Julien A. Raemy, René Martin Schneider

In this paper we report on efforts to enhance the Swiss persistent identifier (PID) ecosystem. We will firstly describe the current situation and the need for improvement in order to describe in full detail the steps undertaken to create a Swiss-wide model.

A case study was undertaken by using several data sets from the domains of art and design in the context of the ICOPAD project. We will provide a set of recommendations to enable a PID service that could mint Archival Resource Key (ARK) identifiers or a flavour of Research Resource Identifiers (RRIDs) as complement to Digital Object Identifiers (DOIs).

We will conclude with some remarks concerning the transferability of this approach to other areas and the requirements for a national hub for PID management in Switzerland.

URL : Towards Trusted Identities for Swiss Researchers and their Data

DOI : https://doi.org/10.2218/ijdc.v14i1.596

Étiquettes Digital Object Identifiers, Julien A. Raemy, Linked Data, René Martin Schneider, Swiss persistent identifier, Switzerland

Linked Research on the Decentralised Web

Auteur de l’article Par Hans Dillaerts
Date de l’article 29 novembre 2019

Author : Sarven Capadisli

This thesis is about research communication in the context of the Web. I analyse literature which reveals how researchers are making use of Web technologies for knowledge dissemination, as well as how individuals are disempowered by the centralisation of certain systems, such as academic publishing platforms and social media.

I share my findings on the feasibility of a decentralised and interoperable information space where researchers can control their identifiers whilst fulfilling the core functions of scientific communication: registration, awareness, certification, and archiving.

The contemporary research communication paradigm operates under a diverse set of sociotechnical constraints, which influence how units of research information and personal data are created and exchanged.

Economic forces and non-interoperable system designs mean that researcher identifiers and research contributions are largely shaped and controlled by third-party entities; participation requires the use of proprietary systems.

From a technical standpoint, this thesis takes a deep look at semantic structure of research artifacts, and how they can be stored, linked and shared in a way that is controlled by individual researchers, or delegated to trusted parties. Further, I find that the ecosystem was lacking a technical Web standard able to fulfill the awareness function of research communication.

Thus, I contribute a new communication protocol, Linked Data Notifications (published as a W3C Recommendation) which enables decentralised notifications on the Web, and provide implementations pertinent to the academic publishing use case. So far we have seen decentralised notifications applied in research dissemination or collaboration scenarios, as well as for archival activities and scientific experiments.

Another core contribution of this work is a Web standards-based implementation of a clientside tool, dokieli, for decentralised article publishing, annotations and social interactions. dokieli can be used to fulfill the scholarly functions of registration, awareness, certification, and archiving, all in a decentralised manner, returning control of research contributions and discourse to individual researchers.

The overarching conclusion of the thesis is that Web technologies can be used to create a fully functioning ecosystem for research communication. Using the framework of Web architecture, and loosely coupling the four functions, an accessible and inclusive ecosystem can be realised whereby users are able to use and switch between interoperable applications without interfering with existing data.

Technical solutions alone do not suffice of course, so this thesis also takes into account the need for a change in the traditional mode of thinking amongst scholars, and presents the Linked Research initiative as an ongoing effort toward researcher autonomy in a social system, and universal access to human- and machine-readable information.

Outcomes of this outreach work so far include an increase in the number of individuals self-hosting their research artifacts, workshops publishing accessible proceedings on the Web, in-the-wild experiments with open and public peer-review, and semantic graphs of contributions to conference proceedings and journals (the Linked Open Research Cloud).

Some of the future challenges include: addressing the social implications of decentralised Web publishing, as well as the design of ethically grounded interoperable mechanisms; cultivating privacy aware information spaces; personal or community-controlled on-demand archiving services; and further design of decentralised applications that are aware of the core functions of scientific communication.

URL : https://csarven.ca/linked-research-decentralised-web

Étiquettes decentralized scholarly communication, Linked Data, research communication, Sarven Capadisli, scientific communication

Creating Structured Linked Data to Generate Scholarly Profiles: A Pilot Project using Wikidata and Scholia

Auteur de l’article Par Hans Dillaerts
Date de l’article 13 décembre 2018

Authors : Mairelys Lemus-Rojas, Jere D. Odell

INTRODUCTION

Wikidata, a knowledge base for structured linked data, provides an open platform for curating scholarly communication data. Because all elements in a Wikidata entry are linked to defining elements and metadata, other web systems can harvest and display the data in meaningful ways.

Thus, Wikidata has the capacity to serve as the data source for faculty profiles. Scholia is an example of how third-party tools can leverage the power of Wikidata to provisde faculty profiles and bibliographic, data-driven visualizations.

DESCRIPTION OF PROGRAM

In this article, we share our methods for contributing to Wikidata and displaying the data with Scholia.

We deployed these methods as part of a pilot project in which we contributed data about a small but unique school on the Indiana University-Purdue University Indianapolis (IUPUI) campus, the IU Lilly Family School of Philanthropy.

NEXT STEPS

Following the completion of our pilot project, we aim to find additional methods for contributing large data collections to Wikidata. Specifically, we seek to contribute scholarly communication data that the library already maintains in other systems.

We are also facilitating Wikidata edit-a-thons to increase the library’s familiarity with the knowledge base and our capacity to contribute to the site.

URL : Creating Structured Linked Data to Generate Scholarly Profiles: A Pilot Project using Wikidata and Scholia

DOI : https://doi.org/10.7710/2162-3309.2272

Étiquettes Jere D. Odell, Linked Data, Mairelys Lemus-Rojas, open data, Wikidata