Authors : Sanghee Oh, Yunseo Park, Seyun Sim
Data Availability Statements (DAS) have become a standard mechanism for promoting transparency and reproducibility in open-access mega journals, yet questions remain about how effectively they support meaningful data sharing in practice and how these practices vary across national research contexts.
This study examines data-sharing practices in the medical and health sciences through a comparative analysis of global publications and Korea-affiliated articles in three mega journals indexed in PubMed Central: PLOS ONE, Scientific Reports and BMJ Open (2020–2024). DAS from 176,145 articles were collected from PubMed Central using an automated pipeline with manual validation and classified into a seven-category typology reflecting levels of data accessibility and reuse.
Results indicate that although DAS inclusion increased over time, repository-based data sharing remains limited, while ‘data available upon request’ continues to be prevalent. Clear differences are observed across journals: PLOS ONE shows greater use of repository-based and in-article sharing, whereas Scientific Reports and BMJ Open rely more heavily on ‘data available upon request’. Korea-affiliated articles largely follow global trends, with slightly greater reliance on national public data repositories.
Repository use is concentrated among a small number of international multidisciplinary platforms and selected national biomedical databases. The findings reveal a persistent gap between formal DAS compliance and effective data accessibility, indicating the need for clearer, more actionable data-sharing guidance.