Characterisation of Temporal Patterns in Step Count Behaviour from Smartphone App Data: An Unsupervised Machine Learning Approach

The increasing ubiquity of smartphone data, with greater spatial and temporal coverage than achieved by traditional study designs, have the potential to provide insight into habitual physical activity patterns. This study implements and evaluates the utility of both K-means clustering and agglomerat...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Francesca Pontin, Nik Lomax, Graham Clarke, Michelle A. Morris
Formato: article
Lenguaje:EN
Publicado: MDPI AG 2021
Materias:
R
Acceso en línea:https://doaj.org/article/8c123dc6713846f096840954df911a49
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
Descripción
Sumario:The increasing ubiquity of smartphone data, with greater spatial and temporal coverage than achieved by traditional study designs, have the potential to provide insight into habitual physical activity patterns. This study implements and evaluates the utility of both K-means clustering and agglomerative hierarchical clustering methods in identifying weekly and yearlong physical activity behaviour trends. Characterising the demographics and choice of activity type within the identified clusters of behaviour. Across all seven clusters of seasonal activity behaviour identified, daylight saving was shown to play a key role in influencing behaviour, with increased activity in summer months. Investigation into weekly behaviours identified six clusters with varied roles, of weekday versus weekend, on the likelihood of meeting physical activity guidelines. Preferred type of physical activity likewise varied between clusters, with gender and age strongly associated with cluster membership. Key relationships are identified between weekly clusters and seasonal activity behaviour clusters, demonstrating how short-term behaviours contribute to longer-term activity patterns. Utilising unsupervised machine learning, this study demonstrates how the volume and richness of secondary app data can allow us to move away from aggregate measures of physical activity to better understand temporal variations in habitual physical activity behaviour.