Google Scholar as a data source for research assessment

Authors : Emilio Delgado López-Cózar, Enrique Orduna-Malea, Alberto Martín-Martín

The launch of Google Scholar (GS) marked the beginning of a revolution in the scientific information market. This search engine, unlike traditional databases, automatically indexes information from the academic web. Its ease of use, together with its wide coverage and fast indexing speed, have made it the first tool most scientists currently turn to when they need to carry out a literature search.

Additionally, the fact that its search results were accompanied from the beginning by citation counts, as well as the later development of secondary products which leverage this citation data (such as Google Scholar Metrics and Google Scholar Citations), made many scientists wonder about its potential as a source of data for bibliometric analyses.

The goal of this chapter is to lay the foundations for the use of GS as a supplementary source (and in some disciplines, arguably the best alternative) for scientific evaluation.

First, we present a general overview of how GS works. Second, we present empirical evidences about its main characteristics (size, coverage, and growth rate). Third, we carry out a systematic analysis of the main limitations this search engine presents as a tool for the evaluation of scientific performance.

Lastly, we discuss the main differences between GS and other more traditional bibliographic databases in light of the correlations found between their citation data. We conclude that Google Scholar presents a broader view of the academic world because it has brought to light a great amount of sources that were not previously visible.

URL : https://arxiv.org/abs/1806.04435

Collaboration Diversity and Scientific Impact

Authors : Yuxiao Dong, Hao Ma, Jie Tang, Kuansan Wang

The shift from individual effort to collaborative output has benefited science, with scientific work pursued collaboratively having increasingly led to more highly impactful research than that pursued individually.

However, understanding of how the diversity of a collaborative team influences the production of knowledge and innovation is sorely lacking. Here, we study this question by breaking down the process of scientific collaboration of 32.9 million papers over the last five decades.

We find that the probability of producing a top-cited publication increases as a function of the diversity of a team of collaborators—namely, the distinct number of institutions represented by the team.

We discover striking phenomena where a smaller, yet more diverse team is more likely to generate highly innovative work than a relatively larger team within one institution.

We demonstrate that the synergy of collaboration diversity is universal across different generations, research fields, and tiers of institutions and individual authors.

Our findings suggest that collaboration diversity strongly and positively correlates with the production of scientific innovation, giving rise to the potential revolution of the policies used by funding agencies and authorities to fund research projects, and broadly the principles used to organize teams, organizations, and societies.

URL : https://arxiv.org/abs/1806.03694

Zombie Journals: Designing a Technological Infrastructure for a Precarious Journal

Authors : Daniel Paul O’Donnell, Carey Viejou, Sylvia Chow, Rumi Graham, Jarret McKinnon, Dorothea Morrison, Reed Parsons, Courtney Rieger, Vanja Spirić, Elaine Toth

Background

The Meeting of the Minds graduate student journal is edited primarily by students from our Masters programme. This means that our editorial board is subject to high annual turnover and that our technological infrastructure and workflow needed to be easy to train for, accommodate differing levels of technological skill and editorial interest, and provide archiving that did not require a continuing interest in the journal by future generations of students.

Analysis

This article provides a detailed and comparative account of the “off-the-shelf ” systems and software used in developing the journal with an explanation of the rationale behind our choices.

Conclusion and implications

The choices we made can be adopted by other journals interested in a low-cost, “future-proof ” approach to developing a publishing infrastructure.

URL : Zombie Journals: Designing a Technological Infrastructure for a Precarious Journal

DOI : https://src-online.ca/index.php/src/article/view/296

Analysis of Peer Review Effectiveness for Academic Journals Based on Distributed Parallel System

Authors : Zong-Yuan Tan, Ning Cai, Jian Zhou

A simulation model based on parallel systems is established, aiming to explore the relation between the number of submissions and the overall quality of academic journals within a similar discipline under peer review.

The model can effectively simulate the submission, review and acceptance behaviors of academic journals, in a distributed manner. According to the simulation experiments, it could possibly happen that the overall standard of academic journals may deteriorate due to excessive submissions.

URL : https://arxiv.org/abs/1806.00287

Creativity in Science and the Link to Cited References: Is the Creative Potential of Papers Reflected in their Cited References?

Authors : Iman Tahamtan, Lutz Bornmann

Several authors have proposed that a large number of unusual combinations of cited references in a paper point to its high creative potential (or novelty). However, it is still not clear whether the number of unusual combinations can really measure the creative potential of papers.

The current study addresses this question on the basis of several case studies from the field of scientometrics. We identified some landmark papers in this field. Study subjects were the corresponding authors of these papers.

We asked them where the ideas for the papers came from and which role the cited publications played. The results revealed that the creative ideas might not necessarily have been inspired by past publications.

The literature seems to be important for the contextualization of the idea in the field of scientometrics. Instead, we found that creative ideas are the result of finding solutions to practical problems, result from discussions with colleagues, and profit from interdisciplinary exchange. The roots of the studied landmark papers are discussed in detail.

URL : https://arxiv.org/abs/1806.00224

Open and transparent research practices and public perceptions of the trustworthiness of agricultural biotechnology organizations

Authors : Asheley R. Landrum, Joseph Hilgard, Robert B. Lull, Heather Akin, Kathleen Hall Jamieson

Public trust in agricultural biotechnology organizations that produce so-called ‘genetically-modified organisms’ (GMOs) is affected by misinformed attacks on GM technology and worry that producers’ concern for profits overrides concern for the public good.

In an experiment, we found that reporting that the industry engages in open and transparent research practices increased the perceived trustworthiness of university and corporate organizations involved with GMOs.

Universities were considered more trustworthy than corporations overall, supporting prior findings in other technology domains.

The results suggest that commitment to, and communication of, open and transparent research practices should be part of the process of implementing agricultural biotechnologies.

URL : Open and transparent research practices and public perceptions of the trustworthiness of agricultural biotechnology organizations

DOI : https://doi.org/10.22323/2.17020204

Conceptualizing Data Curation Activities Within Two Academic Libraries

Authors : Sophia Lafferty-Hess, Julie Rudder, Moira Downey, Susan Ivey, Jennifer Darragh

A growing focus on sharing research data that meet certain standards, such as the FAIR guiding principles, has resulted in libraries increasingly developing and scaling up support for research data.

As libraries consider what new data curation services they would like to provide as part of their repository programs, there are various questions that arise surrounding scalability, resource allocation, requisite expertise, and how to communicate these services to the research community.

Data curation can involve a variety of tasks and activities. Some of these activities can be managed by systems, some require human intervention, and some require highly specialized domain or data type expertise.

At the 2017 Triangle Research Libraries Network Institute, staff from the University of North Carolina at Chapel Hill and Duke University used the 47 data curation activities identified by the Data Curation Network project to create conceptual groupings of data curation activities.

The results of this “thought-exercise” are discussed in this white paper. The purpose of this exercise was to provide more specificity around data curation within our individual contexts as a method to consistently discuss our current service models, identify gaps we would like to fill, and determine what is currently out of scope.

We hope to foster an open and productive discussion throughout the larger academic library community about how we prioritize data curation activities as we face growing demand and limited resources.

URL : Conceptualizing Data Curation Activities Within Two Academic Libraries

DOI : https://dx.doi.org/10.17605/OSF.IO/ZJ5PQ