Researchers and their data: A study based on the use of the word data in scholarly articles

Authors : Frédérique Bordignon, Marion Maisonobe

Data is one of the most used terms in scientific vocabulary. This article focuses on the relationship between data and research by analyzing the contexts of occurrence of the word data in a corpus of 72,471 research articles (1980–2012) from two distinct fields (Social sciences, Physical sciences).

The aim is to shed light on the issues raised by research on data, namely the difficulty of defining what is considered as data, the transformations that data undergo during the research process, and how they gain value for researchers who hold them.

Relying on the distribution of occurrences throughout the texts and over time, it demonstrates that the word data mostly occurs at the beginning and end of research articles. Adjectives and verbs accompanying the noun data turn out to be even more important than data itself in specifying data.

The increase in the use of possessive pronouns at the end of the articles reveals that authors tend to claim ownership of their data at the very end of the research process. Our research demonstrates that even if data-handling operations are increasingly frequent, they are still described with imprecise verbs that do not reflect the complexity of these transformations.

URL : Researchers and their data: A study based on the use of the word data in scholarly articles

DOI : https://doi.org/10.1162/qss_a_00220

Over-promotion and caution in abstracts of preprints during the COVID-19 crisis

Authors : Frederique Bordignon, Liana Ermakova, Marianne Noel

The abstract is known to be a promotional genre where researchers tend to exaggerate the benefit of their research and use a promotional discourse to catch the reader’s attention. The COVID-19 pandemic has prompted intensive research and has changed traditional publishing with the massive adoption of preprints by researchers.

Our aim is to investigate whether the crisis and the ensuing scientific and economic competition have changed the lexical content of abstracts. We propose a comparative study of abstracts associated with preprints issued in response to the pandemic relative to abstracts produced during the closest pre-pandemic period.

We show that with the increase (on average and in percentage) of positive words (especially effective) and the slight decrease of negative words, there is a strong increase in hedge words (the most frequent of which are the modal verbs can and may).

Hedge words counterbalance the excessive use of positive words and thus invite the readers, who go probably beyond the ‘usual’ audience, to be cautious with the obtained results.

The abstracts of preprints urgently produced in response to the COVID-19 crisis stand between uncertainty and over-promotion, illustrating the balance that authors have to achieve between promoting their results and appealing for caution.

DOI : https://doi.org/10.1002/leap.1411

Preprint Abstracts in Times of Crisis: a Comparative Study with the Pre-pandemic Period

Authors : Frédérique Bordignon, Liana Ermakova, Marianne Noel

The urgency to respond to the COVID-19 outbreak has driven an unprecedented surge in preprints that aim to speed up knowledge dissemination as they are available much sooner than peer-reviewed publications.

In this study we consider abstracts of research articles and preprints as main entry points that draw attention to the most important information of the document and that try to entice us to read the whole article. In this paper, we try to capture and examine shifts in scientific abstract writing produced at the very beginning of the pandemic.

We made a comparative study of abstracts in terms of their informativeness associated with preprints issued in response to the COVID-19 pandemic and those produced in 2019, the closest pre-pandemic period. Our results clearly differ from one preprint server to another and show that there are community-centered habits as regards writing and reporting results.

The preprints issued from the arXiv, ChemRxiv and Research Square servers tend to have more informative (generous) abstracts than the ones submitted to the other servers. In four servers, the ratio of structured abstracts decreases with the pandemic.

URL : Preprint Abstracts in Times of Crisis: a Comparative Study with the Pre-pandemic Period

Original location : https://hal-enpc.archives-ouvertes.fr/hal-03187900

Impact de l’Open Access sur les citations : une étude de cas

Auteurs/Authors : Frédérique Bordignon, Mathieu Andro

De multiples études, dans la littérature internationale, ont cherché à évaluer l’impact de l’Open Access sur le taux de citation des articles scientifiques. La présente étude, en langue française, reste limitée aux publications 2010 de l’Ecole des Ponts.

Elle offre néanmoins un état de l’art des précédentes études sur le sujet à un lectorat de professionnels francophones et a pour originalité de mesurer le nombre moyen de citations par mois, avant et après “libération” Open Access des articles et d’éviter ainsi la plupart des biais qui peuvent être rencontrés dans ce type de démarche.

En plus de confirmer, comme beaucoup d’autres l’ont fait auparavant, un avantage net de l’Open Access sur le taux de citation en informatique, sciences de la terre et de l’univers, ingénierie, sciences environnementales, mathématiques, physique et astronomie, elle montre aussi qu’une « libération » précoce peut avoir un impact plus favorable qu’une « libération » tardive dans certains champs disciplinaires, comme les mathématiques et physique/astronomie.

URL : Impact de l’Open Access sur les citations : une étude de cas