Analyzing University of Virginia Health publications using open data, Python, and Streamlit

As part of a larger project to understand the publishing choices of UVA Health authors and support open access publishing, a team from the Claude Moore Health Sciences Library analyzed an open data set from Europe PMC, which includes metadata from PubMed records. We used the Europe PMC REST API to s...

Full description

Saved in:
Bibliographic Details
Main Authors: Anson Parker, Abbey Heflin, Lucy Carr Jones
Format: article
Language:EN
Published: University Library System, University of Pittsburgh 2021
Subjects:
Z
R
Online Access:https://doaj.org/article/76db779472e14d4d9e44b0edd8e25a25
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:As part of a larger project to understand the publishing choices of UVA Health authors and support open access publishing, a team from the Claude Moore Health Sciences Library analyzed an open data set from Europe PMC, which includes metadata from PubMed records. We used the Europe PMC REST API to search for articles published in 2017–2020 with “University of Virginia” in the author affiliation field. Subsequently, we parsed the JSON metadata in Python and used Streamlit to create a data visualization from our public GitHub repository. At present, this shows the relative proportions of open access versus subscription-only articles published by UVA Health authors. Although subscription services like Web of Science, Scopus, and Dimensions allow users to do similar analyses, we believe this is a novel approach to doing this type of bibliometric research with open data and open source tools.