A Scientific Knowledge Graph with Community Detection and Routes of Search. Testing “GRAPHYP” as a Toolkit for Resilient Upgrade of Scholarly Content

Authors : Renaud Fabre, Otmane Azeroual, Patrice Bellot, Joachim Schöpfel, Daniel Egret

Unlimited change in scientific terminology challenges integrity in scientific knowledge graph (SKG) representation, while current data and modeling standards, mostly document oriented, hardly allow a resilient semantic upgrade of scholarly content.

Moreover, results of a “multimodal knowledge acquisition” are required for an efficient upgrade of search methods: « vital nodes » differ among users of the same keyword, due to distinct needs of scientific communities, rooted in their own interpretations and controversies.

Modeling and data are challenged to propose new outcomes, mixing automated information and human choices allowing dynamic community detection: to fulfill this program with GRAPHYP toolkit, we identify a workflow ensuring the objectives of integrity and completeness of search management activities.

It encompasses data standards for « routes » of search, modeling of community detection and navigation inside SK bipartite hypergraph, and a first test with extraction of characteristics of communities’ preferences from readings of scholarly content.

“Search is not Research” and therefore further work should explore the links between modeling and data recording research contents and “search and select” results in SKG data structure.

URL : A Scientific Knowledge Graph with Community Detection and Routes of Search. Testing “GRAPHYP” as a Toolkit for Resilient Upgrade of Scholarly Content

Original location : https://hal.archives-ouvertes.fr/hal-03365118

Evaluating the scientific impact of research infrastructures: The role of current research information systems

Authors : Renaud Fabre, Daniel Egret, Joachim Schöpfel, Otmane Azeroual

Research infrastructures (RI) offer researchers a multitude of research opportunities and services and play a key role in the performance, innovative strength, and international competitiveness of science. As an important part of the generation and use of new knowledge and technologies, they are essential for research policies.

Because of their strategic importance and their need for significant funding, there is a growing demand for the assessment of their scientific output and impact. Current research information systems (CRIS) have contributed for many years now to the evaluation of universities and research organizations.

Based on studies on the application of CRIS to infrastructures and on a recent French report on the scientometric assessment of RI, this paper analyzes the potential of CRIS and their data models and standards (in particular the international CERIF format and the German RDC model) for the monitoring and evaluation of RI.

The interaction between functional specificities of RI and standards for their assessment is outlined, with reference to their own potential to stimulate and share innovation in the networks located inside and outside RI.

This societal challenge, more than an academic issue, is on the way to further harmonization and consolidation of shared and common RI metrics.

DOI : https://doi.org/10.1162/qss_a_00111

Open Bioeconomy—A Bibliometric Study on the Accessibility of Articles in the Field of Bioeconomy

Authors : Marianne Duquenne, Hélène Pros, Joachim Schöpfel, Franck Dumeignil

Open access (OA) to scientific information is one of the major challenges and objectives of actual public research policy. The purpose of this paper is to assess the degree of openness of scientific articles on bioeconomy, as one of the emergent research fields at the crossroads of several disciplines and with high societal and industrial impact.

Based on a Web of Science (WoS) corpus of 2489 articles published between 2015 and 2019, we calculated bibliometric indicators, explored the openness of each article and assessed the share of journals, countries and research areas of these articles.

The results show a sharp increase and diversification of articles in the field of bioeconomy, with a beginning long tail distribution. 45.6% of the articles are freely available and the share of OA articles is steadily increasing, from 31% in 2015 to 52% in 2019.

Gold is the most important variant of OA. Open access is low in the applied research areas of chemical, agricultural and environmental engineering but higher in the domains of energy and fuels, forestry and green and sustainable science and technology.

The UK and the Netherlands have the highest rates of OA articles, followed by Spain and Germany. The funding rate of OA articles is higher than of non-OA articles. This is the first bibliometric study on open access to articles on bioeconomy.

The results can be useful for the further development of OA editorial and funding criteria in the field of bioeconomy.

URL : Open Bioeconomy—A Bibliometric Study on the Accessibility of Articles in the Field of Bioeconomy

DOI : https://doi.org/10.3390/publications8040055

Research Ethics, Open Science and CRIS

Authors : Joachim Schöpfel, Otmane Azeroaul, Monika Jungbauer-Gan

The purpose of this paper is to analyze how current research information systems (CRIS) take into account ethical issues, especially in the environment of open science. The analysis is based on a review of the literature on research information management, CRIS, open science and research ethics.

The paper provides a framework for the assessment of CRIS on two levels: are CRIS (= their data model, format, functionalities, etc.) compliant with ethical requirements from the research community, funding bodies, government, etc., i.e., can they appropriately process data on research ethics (protocols, misconduct, etc.), and which are the ethical issues of the development, implementation and usage of CRIS?

What is the impact of new ethical requirements from the open science movement, such as integrity or transparency? Can CRIS be considered as ethical infrastructures or “infraethics”?

Concluding this analysis, the paper proposes an empirical approach for further investigation of this topic. The originality of the paper is that there are very few studies so far that assess the implications of research ethics and open science on the CRIS.

URL : https://hal.univ-lille.fr/hal-03034276

Penser local. Développer une politique de données sur un campus SHS

Auteur/Author : Joachim Schöpfel

Dans le cadre du Plan national pour la science ouverte, la structuration et le partage des données de recherche font désormais partie des priorités de la politique scientifique de la France.

Chaque établissement et chaque organisme scientifique doit se doter d’une politique de la science ouverte et mettre en place un ensemble de services et dispositifs pour la gestion des données de la recherche.

A partir d’enquêtes sur le terrain, l’article propose une feuille de route pour la mise en œuvre d’une telle politique sur un campus universitaire en sciences humaines et sociales.

Dix principes indiquent des pistes pour la gouvernance et le pilotage de cette politique, pour déterminer les priorités de développement et d’investissements, et pour faire le lien avec les infrastructures de recherche, dont notamment Huma-Num.

Il s’agit d’une démarche bottom-up, qui met l’accent sur les pratiques et besoins des chercheurs et qui place les chercheurs au cœur d’une politique institutionnelle dans le domaine des données de recherche.

URL : https://www.openscience.fr/Penser-local

L’éthique des données de la recherche en sciences humaines et sociales. Une introduction

Auteurs/Authors : Bernard Jacquemin, Joachim Schöpfel, Stéphane Chaudiron, Eric Kergosien

L’organisation de l’accès libre aux données scientifiques fait partie des objectifs de la recherche publique de la France. La volonté d’ouvrir les données de la recherche a été confirmée par le plan d’action national 2018-2020 dont l’engagement 18 vise à construire un écosystème de la science ouverte.

Sur le terrain, la politique d’ouverture s’accompagne d’une forte incitation à mettre en œuvre des bonnes pratiques scientifiques compatibles avec certains principes définis au niveau européens comme « FAIR Guiding Principles » de la gestion et du pilotage des données de la recherche. Quelle est la dimension éthique d’une gestion « FAIR » des données de la recherche?

À partir d’une sélection de publications récentes, d’enquêtes, travaux et activités menées autour des données de la recherche, notre communication essaie de synthétiser plusieurs aspects de la dimension éthique de la gestion des données de la recherche, dans l’environnement français, dont la place de l’éthique dans les plans de gestion, les données personnelles, la crédibilité ou encore la sécurité des données.

URL : https://hal.archives-ouvertes.fr/GERIICO/hal-01958472v1

Data papers as a new form of knowledge organization in the field of research data

Authors : Joachim Schöpfel, Dominic Farace, Hélène Prost, Antonella Zane

Data papers have been defined as scholarly journal publications whose primary purpose is to describe research data. Our survey provides more insights about the environment of data papers, i.e. disciplines, publishers and business models, and about their structure, length, formats, metadata and licensing.

Data papers are a product of the emerging ecosystem of data-driven open science. They contribute to the FAIR principles for research data management. However, the boundaries with other categories of academic publishing are partly blurred. Data papers are (can be) generated automatically and are potentially machine-readable.

Data papers are essentially information, i.e. description of data, but also partly contribute to the generation of knowledge and data on its own. Part of the new ecosystem of open and data-driven science, data papers and data journals are an interesting and relevant object for the assessment and understanding of the transition of the former system of academic publishing.

URL : https://halshs.archives-ouvertes.fr/ISKOFRANCE2019/halshs-02284548