Caching and Reproducibility: Making Data Science Experiments Faster and FAIRer

Authors : Moritz Schubotz, Ankit Satpute, André Greiner-Petter, Akiko Aizawa, Bela Gipp

Small to medium-scale data science experiments often rely on research software developed ad-hoc by individual scientists or small teams. Often there is no time to make the research software fast, reusable, and open access.

The consequence is twofold. First, subsequent researchers must spend significant work hours building upon the proposed hypotheses or experimental framework. In the worst case, others cannot reproduce the experiment and reuse the findings for subsequent research. Second, suppose the ad-hoc research software fails during often long-running computational expensive experiments.

In that case, the overall effort to iteratively improve the software and rerun the experiments creates significant time pressure on the researchers. We suggest making caching an integral part of the research software development process, even before the first line of code is written.

This article outlines caching recommendations for developing research software in data science projects. Our recommendations provide a perspective to circumvent common problems such as propriety dependence, speed, etc. At the same time, caching contributes to the reproducibility of experiments in the open science workflow.

Concerning the four guiding principles, i.e., Findability, Accessibility, Interoperability, and Reusability (FAIR), we foresee that including the proposed recommendation in a research software development will make the data related to that software FAIRer for both machines and humans.

We exhibit the usefulness of some of the proposed recommendations on our recently completed research software project in mathematical information retrieval.

URL : Caching and Reproducibility: Making Data Science Experiments Faster and FAIRer

DOI : https://doi.org/10.3389/frma.2022.861944

Academic vs. biological age in research on academic careers: a large-scale study with implications for scientifically developing systems

Authors : Marek Kwiek, Wojciech Roszka

Biological age is an important sociodemographic factor in studies on academic careers (research productivity, scholarly impact, and collaboration patterns). It is assumed that the academic age, or the time elapsed from the first publication, is a good proxy for biological age.

In this study, we analyze the limitations of the proxy in academic career studies, using as an example the entire population of Polish academic scientists and scholars visible in the last decade in global science and holding at least a PhD (N = 20,569). The proxy works well for science, technology, engineering, mathematics, and medicine (STEMM) disciplines; however, for non-STEMM disciplines (particularly for humanities and social sciences), it has a dramatically worse performance.

This negative conclusion is particularly important for systems that have only recently visible in global academic journals. The micro-level data suggest a delayed participation of social scientists and humanists in global science networks, with practical implications for predicting biological age from academic age.

We calculate correlation coefficients, present contingency analysis of academic career stages with academic positions and age groups, and create a linear multivariate regression model.

Our research suggests that in scientifically developing countries, academic age as a proxy for biological age should be used more cautiously than in advanced countries: ideally, it should be used only for STEMM disciplines.

URL : Academic vs. biological age in research on academic careers: a large-scale study with implications for scientifically developing systems

DOI : https://doi.org/10.1007/s11192-022-04363-0

Institutionalizing Open Science in Africa: Limitations and Prospects

Authors : Izuchukwu Azuka Okafor, Smart Ikechukwu Mbagwu, Terkuma Chia, Zuwati Hasim, Echezona Ejike Udokanma, Karthik Chandran

The advancement of scientific research and raising the next-generation scientists in Africa depend largely on science access. The COVID-19 pandemic has caused discussions around open science (OS) to reemerge globally, especially in resource-poor settings like Africa, where the practice of OS is low.

The authors highlighted the elements, benefits, and existing initiatives of OS in Africa. More importantly, the article critically appraised the challenges, opportunities, and future considerations of OS in Africa. Addressing challenges of funding and leadership at different levels of educational, research, and government parastatals may be pivotal in charting a new course for OS in Africa.

This review serves as an advocacy strategy and an informative guide to policymaking and institutionalization of OS in Africa.

URL : Institutionalizing Open Science in Africa: Limitations and Prospects

DOI : https://doi.org/10.3389/frma.2022.855198

Advancing Self-Evaluative and Self-Regulatory Mechanisms of Scholarly Journals: Editors’ Perspectives on What Needs to Be Improved in the Editorial Process

Author : Katarina Krapež

Meticulous self-evaluative practices in the offices of academic periodicals can be helpful in reducing widespread uncertainty about the quality of scholarly journals. This paper summarizes the results of the second part of a qualitative worldwide study among 258 senior editors of scholarly journals across disciplines.

By means of a qualitative questionnaire, the survey investigated respondents’ perceptions of needed changes in their own editorial workflow that could, according to their beliefs, positively affect the quality of their journals.

The results show that the most relevant past improvements indicated by respondents were achieved by: (a) raising the required quality criteria for manuscripts, by defining standards for desk rejection and/or shaping the desired qualities of the published material, and (b) guaranteeing a rigorous peer review process.

Respondents believed that, currently, three areas have the most pressing need for amendment: ensuring higher overall quality of published articles (26% of respondents qualified this need as very high or high), increasing the overall quality of peer-review reports (23%), and raising reviewers’ awareness of the required quality standards (20%).

Bivariate analysis shows that respondents who work with non-commercial publishers reported an overall greater need to improve implemented quality assessment processes. Work overload, inadequate reward systems, and a lack of time for development activities were cited by respondents as the greatest obstacles to implementing necessary amendments.

URL : Advancing Self-Evaluative and Self-Regulatory Mechanisms of Scholarly Journals: Editors’ Perspectives on What Needs to Be Improved in the Editorial Process

DOI : https://doi.org/10.3390/publications10010012

Data Services Librarians’ Responsibilities and Perspectives on Research Data Management

Authors : Bradley Wade Bishop, Ashley M. Orehek, Christopher Eaker, Plato L. Smith

This study of data services librarians is part of a series of studies examining the current roles and perspectives on Research Data Management (RDM) services in higher education. Reviewing current best practices provides insights into the role-based responsibilities for RDM services that data services librarians perform, as well as ways to improve and create new services to meet the needs of their respective university communities.

Objectives

The objectives of this article are to provide the context of research data services through a review of past studies, explain how they informed this qualitative study, and provide the methods and results of the current study.

This study provides an in-depth overview of the overall job responsibilities of data services librarians and as well as their perspectives on RDM through job analyses.

Methods

Job analysis interviews provide insight and context to the tasks employees do as described in their own words. Interviews with 10 data services librarians recruited from the top 10 public and top 10 private universities according to the 2020 Best National University Rankings in the US News and World Reports were asked 30 questions concerning their overall job tasks and perspectives on RDM.

Five public and five private data services librarians were interviewed. The interviews were recorded and transcribed. The transcriptions were analyzed in NVivo using a grounded theory application of open, axial, and selective coding to generate categories and broad themes based on the responses using synonymous meanings.

Results: The results presented here provide the typical job tasks of data services librarians that include locating secondary data, reviewing data management plans (DMPs), conducting outreach, collaborating, and offering RDM training. Fewer data services librarians assisted with data curation or manage an institutional repository.

Discussion

The results indicate that there may be different types of data services librarians depending on the mix of responsibilities. Academic librarianship will benefit from further delineation of job titles using tasks while planning, advertising, hiring, and evaluating workers in this emerging area. There remain many other explorations needed to understand the challenges and opportunities for data services librarians related to RDM.

Conclusions

This article concludes with a proposed matrix of job tasks that indicates different types of data services librarians to inform further study. Future job descriptions, training, and education will all benefit from differentiating between the many associated research data services roles and with increased focus on research data greater specializations will emerge.

URL : Data Services Librarians’ Responsibilities and Perspectives on Research Data Management

DOI : https://doi.org/10.7191/jeslib.2022.1226

Perspectives on Gender in Science, Technology, and Innovation: A Review of Sub-Saharan Africa’s Science Granting Councils and Achieving the Sustainable Development Goals

Authors : Jose C. Jackson, Jane G. Payumo, Amy J. Jamison, Michael L. Conteh, Petronella Chirawu

Africa’s focus on science, technology, and innovation (STI) has grown over the last decade, with emerging examples of good practice. There are however numerous challenges to sustainable development in Africa; for example, inequalities within and among African countries are rising and enormous disparities of opportunity, wealth, and power persist.

While policy makers and organizations have put increasing emphasis on integrating gender into STI policies and initiatives as a means to achieve gender equality for all women and girls, inequality remains a key challenge to continental sustainable development. STI funders such as the Science Granting Councils (SGCs) in Africa are key players in national innovation systems.

They advise and facilitate policy and program development, disburse funds, build research capacity, set and monitor research agendas, manage bilateral and multilateral STI agreements, and assess the communication, uptake, and impact of research.

They, therefore, have a major role to play in enabling countries to achieve SDG5. This study assessed the current actions in gender mainstreaming across the SGCs and the status of gender research and collaboration in participating countries.

Our findings provide evidence of uneven progress in promoting gender equality in the operations of the SGCs, including funding research and promoting the integration of gender dimensions in research content and curricula.

All SGCs emphasized national commitments to gender, and the importance of gender in STI, but acknowledged that at the structural and institutional levels there was a misalignment between policy and practice. As expected, more men than women were employed across most levels at the SGCs and held positions of seniority and decision making.

Most of the SGCs had very limited or no gender-related funding programs to promote gender and STI or to eliminate the barriers that women scholars face.

This resulted in persistent inequalities in who received funding, the size of the grants they received, and in the knowledge production, collaboration, and the impact on their country’s gender-related research. These findings suggest that SGCs need to strengthen their actions to mainstream gender if they are to achieve success with SDG5.

URL : Perspectives on Gender in Science, Technology, and Innovation: A Review of Sub-Saharan Africa’s Science Granting Councils and Achieving the Sustainable Development Goals

DOI : https://doi.org/10.3389/frma.2022.814600

Peer reviewers equally critique theory, method, and writing, with limited effect on the final content of accepted manuscripts

Author : Dimity Stephen

The primary aims of peer review are to detect flaws and deficiencies in the design and interpretation of studies, and ensure the clarity and quality of their presentation. However, it has been questioned whether peer review fulfils this function.

Studies have highlighted a stronger focus of reviewers on critiquing methodological aspects of studies and the quality of writing in biomedical sciences, with less focus on theoretical grounding. In contrast, reviewers in the social sciences appear more concerned with theoretical underpinnings.

These studies also found the effect of peer review on manuscripts’ content to be variable, but generally modest and positive. I qualitatively analysed 1430 peer reviewers’ comments for a sample of 40 social science preprint-publication pairs to identify the key foci of reviewers’ comments.

I then quantified the effect of peer review on manuscripts by examining differences between the preprint and published versions using the normalised Levenshtein distance, cosine similarity, and word count ratios for titles, abstracts, document sections and full-texts.

I also examined changes in references used between versions and linked changes to reviewers’ comments. Reviewers’ comments were nearly equally split between issues of methodology (30.7%), theory (30.0%), and writing quality (29.2%).

Titles, abstracts, and the semantic content of documents remained similar, although publications were typically longer than preprints.

Two-thirds of citations were unchanged, 20.9% were added during review and 13.1% were removed. These findings indicate reviewers equally attended to the theoretical and methodological details and communication style of manuscripts, although the effect on quantitative measures of the manuscripts was limited.

URL : Peer reviewers equally critique theory, method, and writing, with limited effect on the final content of accepted manuscripts

DOI : https://doi.org/10.1007/s11192-022-04357-y