A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball

Sports big data has been an emerging research area in recent years. The purpose of this study was to ascertain the most frequent research topics, application areas, data sources, and data usage characteristics in the existing literature, in order to understand the development of data-driven baseball...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Jyh-How Huang, Yu-Chia Hsu
Formato: article
Lenguaje:EN
Publicado: SAGE Publishing 2021
Materias:
H
Acceso en línea:https://doaj.org/article/020431f8b54845dd92b119767f9a8ae0
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:020431f8b54845dd92b119767f9a8ae0
record_format dspace
spelling oai:doaj.org-article:020431f8b54845dd92b119767f9a8ae02021-12-02T02:03:44ZA Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball2158-244010.1177/21582440211061566https://doaj.org/article/020431f8b54845dd92b119767f9a8ae02021-11-01T00:00:00Zhttps://doi.org/10.1177/21582440211061566https://doaj.org/toc/2158-2440Sports big data has been an emerging research area in recent years. The purpose of this study was to ascertain the most frequent research topics, application areas, data sources, and data usage characteristics in the existing literature, in order to understand the development of data-driven baseball research and the multidisciplinary participation in the big data era. A scoping review was conducted, focusing on the diversity of using publicly available major league baseball data. Next, the co-occurrence analysis in bibliometrics was used to present a knowledge map of the reviewed literature. Finally, we propose a comprehensive baseball data research domain framework to visualize the ecosystem of publicly available sports data applications mapped to the four application domains in the big data maturity model. After searching and screening process from the Web of Science, Science Direct, and SPORTDiscus database, 48 relevant papers with clearly indicated data sources and data fields used were finally selected and full reviewed for advanced analysis. The most relevant research hotspots for sports data are sequentially economics and finance, sports injury, and sports performance evaluation. Subjects studied ranged from pitchers, position players, catchers, umpires, batters, free agents, and attendees. The most popular data sources are PITCHf/x, the Lahman Baseball Database, and baseball-reference.com. This review can serve as a valuable starting point for researchers to plan research strategies, to discover opportunities for cross-disciplinary research innovations, and to categorize their work in the context of the state of research.Jyh-How HuangYu-Chia HsuSAGE PublishingarticleHistory of scholarship and learning. The humanitiesAZ20-999Social SciencesHENSAGE Open, Vol 11 (2021)
institution DOAJ
collection DOAJ
language EN
topic History of scholarship and learning. The humanities
AZ20-999
Social Sciences
H
spellingShingle History of scholarship and learning. The humanities
AZ20-999
Social Sciences
H
Jyh-How Huang
Yu-Chia Hsu
A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball
description Sports big data has been an emerging research area in recent years. The purpose of this study was to ascertain the most frequent research topics, application areas, data sources, and data usage characteristics in the existing literature, in order to understand the development of data-driven baseball research and the multidisciplinary participation in the big data era. A scoping review was conducted, focusing on the diversity of using publicly available major league baseball data. Next, the co-occurrence analysis in bibliometrics was used to present a knowledge map of the reviewed literature. Finally, we propose a comprehensive baseball data research domain framework to visualize the ecosystem of publicly available sports data applications mapped to the four application domains in the big data maturity model. After searching and screening process from the Web of Science, Science Direct, and SPORTDiscus database, 48 relevant papers with clearly indicated data sources and data fields used were finally selected and full reviewed for advanced analysis. The most relevant research hotspots for sports data are sequentially economics and finance, sports injury, and sports performance evaluation. Subjects studied ranged from pitchers, position players, catchers, umpires, batters, free agents, and attendees. The most popular data sources are PITCHf/x, the Lahman Baseball Database, and baseball-reference.com. This review can serve as a valuable starting point for researchers to plan research strategies, to discover opportunities for cross-disciplinary research innovations, and to categorize their work in the context of the state of research.
format article
author Jyh-How Huang
Yu-Chia Hsu
author_facet Jyh-How Huang
Yu-Chia Hsu
author_sort Jyh-How Huang
title A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball
title_short A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball
title_full A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball
title_fullStr A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball
title_full_unstemmed A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball
title_sort multidisciplinary perspective on publicly available sports data in the era of big data: a scoping review of the literature on major league baseball
publisher SAGE Publishing
publishDate 2021
url https://doaj.org/article/020431f8b54845dd92b119767f9a8ae0
work_keys_str_mv AT jyhhowhuang amultidisciplinaryperspectiveonpubliclyavailablesportsdataintheeraofbigdataascopingreviewoftheliteratureonmajorleaguebaseball
AT yuchiahsu amultidisciplinaryperspectiveonpubliclyavailablesportsdataintheeraofbigdataascopingreviewoftheliteratureonmajorleaguebaseball
AT jyhhowhuang multidisciplinaryperspectiveonpubliclyavailablesportsdataintheeraofbigdataascopingreviewoftheliteratureonmajorleaguebaseball
AT yuchiahsu multidisciplinaryperspectiveonpubliclyavailablesportsdataintheeraofbigdataascopingreviewoftheliteratureonmajorleaguebaseball
_version_ 1718402692092526592