# A Century of Science: Globalization of Scientific Collaborations, Citations, and Innovations

Authors : Yuxiao Dong, Hao Ma, Zhihong Shen, Kuansan Wang

Progress in science has advanced the development of human society across history, with dramatic revolutions shaped by information theory, genetic cloning, and artificial intelligence, among the many scientific achievements produced in the 20th century. However, the way that science advances itself is much less well-understood.

In this work, we study the evolution of scientific development over the past century by presenting an anatomy of 89 million digitalized papers published between 1900 and 2015.

We find that science has benefited from the shift from individual work to collaborative effort, with over 90% of the world-leading innovations generated by collaborations in this century, nearly four times higher than they were in the 1900s.

We discover that rather than the frequent myopic- and self-referencing that was common in the early 20th century, modern scientists instead tend to look for literature further back and farther around.

Finally, we also observe the globalization of scientific development from 1900 to 2015, including 25-fold and 7-fold increases in international collaborations and citations, respectively, as well as a dramatic decline in the dominant accumulation of citations by the US, the UK, and Germany, from 95% to 50% over the same period.

Our discoveries are meant to serve as a starter for exploring the visionary ways in which science has developed throughout the past century, generating insight into and an impact upon the current scientific innovations and funding policies.

# Should biomedical research be like Airbnb?

Authors : Vivien R. Bonazzi, Philip E. Bourne

The thesis presented here is that biomedical research is based on the trusted exchange of services. That exchange would be conducted more efficiently if the trusted software platforms to exchange those services, if they exist, were more integrated.

While simpler and narrower in scope than the services governing biomedical research, comparison to existing internet-based platforms, like Airbnb, can be informative.

We illustrate how the analogy to internet-based platforms works and does not work and introduce The Commons, under active development at the National Institutes of Health (NIH) and elsewhere, as an example of the move towards platforms for research.

# An Interactive Map for Showcasing Repository Impacts

Authors : Hui Zhang, Camden Lopez

Digital repository managers rely on usage metrics such as the number of downloads to demonstrate research visibility and impacts of the repositories. Increasingly, they find that current tools such as spreadsheets and charts are ineffective for revealing important elements of usage, including reader locations, and for attracting the targeted audiences.

This article describes the design and development of a readership map that provides an interactive, near-real-time visualization of actual visits to an institutional repository using data from Google Analytics.

The readership map exhibits the global impacts of a repository by displaying the city of every view or download together with the title of the scholarship being read and a hyperlink to its page in the repository.

We will discuss project motivation and development issues such as authentication with Google API, metadata integration, performance tuning, and data privacy.

# Linked Data is People: Building a Knowledge Graph to Reshape the Library Staff Directory

Authors : Jason A. Clark, Scott W. H. Young

One of our greatest library resources is people. Most libraries have staff directory information published on the web, yet most of this data is trapped in local silos, PDFs, or unstructured HTML markup.

With this in mind, the library informatics team at Montana State University (MSU) Library set a goal of remaking our people pages by connecting the local staff database to the Linked Open Data (LOD) cloud.

In pursuing linked data integration for library staff profiles, we have realized two primary use cases: improving the search engine optimization (SEO) for people pages and creating network graph visualizations.

In this article, we will focus on the code to build this library graph model as well as the linked data workflows and ontology expressions developed to support it. Existing linked data work has largely centered around machine-actionable data and improvements for bots or intelligent software agents.

Our work demonstrates that connecting your staff directory to the LOD cloud can reveal relationships among people in dynamic ways, thereby raising staff visibility and bringing an increased level of understanding and collaboration potential for one of our primary assets: the people that make the library happen.

# Anticipated effects of an open access policy at a private foundation

Author : Eesha Khare, Carly Strasser

The Gordon and Betty Moore Foundation (GBMF) was interested in understanding the potential effects of a policy requiring open access to peer-reviewed publications resulting from the research the foundation funds.

To explore this question, we collected data on more than 2000 publications in over 500 journals that were generated by GBMF grantees since 2001. We then examined the journal policies to establish how two possible open access policies might have affected grantee publishing habits.

We found that 99.3% of the articles published by grantees would have complied with a policy that requires open access within 12 months of publication. We also estimated the annual costs to GBMF for covering fees associated with « gold open access » to be between $250,000 and$2,500,000 annually.

# Open Science: What, Why, and How

Authors : Barbara A. Spellman, Elizabeth A. Gilbert, Katherine S. Corker

Open Science is a collection of actions designed to make scientific processes more transparent and results more accessible. Its goal is to build a more replicable and robust science; it does so using new technologies, altering incentives, and changing attitudes.

The current movement towards open science was spurred, in part, by a recent “series of unfortunate events” within psychology and other sciences.

These events include the large number of studies that have failed to replicate and the prevalence of common research and publication procedures that could explain why.

Many journals and funding agencies now encourage, require, or reward some open science practices, including pre-registration, providing full materials, posting data, distinguishing between exploratory and confirmatory analyses, and running replication studies.

Individuals can practice and encourage open science in their many roles as researchers, authors, reviewers, editors, teachers, and members of hiring, tenure, promotion, and awards committees.

A plethora of resources are available to help scientists, and science, achieve these goals.

# Perseids: Experimenting with Infrastructure for Creating and Sharing Research Data in the Digital Humanities

Author : Bridget Almas

The Perseids project provides a platform for creating, publishing, and sharing research data, in the form of textual transcriptions, annotations and analyses. An offshoot and collaborator of the Perseus Digital Library (PDL),

Perseids is also an experiment in reusing and extending existing infrastructure, tools, and services.

This paper discusses infrastructure in the domain of digital humanities (DH). It outlines some general approaches to facilitating data sharing in this domain, and the specific choices we made in developing Perseids to serve that goal.

It concludes by identifying lessons we have learned about sustainability in the process of building Perseids, noting some critical gaps in infrastructure for the digital humanities, and suggesting some implications for the wider community.

