Data Science Tools for Monitoring the Global Repository Eco-System and its Lines of Evolution

Authors : Friedrich Summann, Andreas Czerniak, Jochen Schirrwagen, Dirk Pieper

The global network of scholarly repositories for the publication and dissemination of scientific publications and related materials can already look back on a history of more than twenty years.

During this period, there have been many developments in terms of technical optimization and the increase of content. It is crucial to observe and analyze this evolution in order to draw conclusions for the further development of repositories.

The basis for such an analysis is data. The Open Archives Initiative (OAI) service provider Bielefeld Academic Search Engine (BASE) started indexing repositories in 2004 and has collected metadata also on repositories.

This paper presents the main features of a planned repository monitoring system. Data have been collected since 2004 and includes basic repository metadata as well as publication metadata of a repository.

This information allows an in-depth analysis of many indicators in different logical combinations. This paper outlines the systems approach and the integration of data science techniques. It describes the intended monitoring system and shows the first results.

URL : Data Science Tools for Monitoring the Global Repository Eco-System and its Lines of Evolution

DOI : https://doi.org/10.3390/publications8020035