Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.

Due to difficulty in early diagnosis of Alzheimer's disease (AD) related to cost and differentiated capability, it is necessary to identify low-cost, accessible, and reliable tools for identifying AD risk in the preclinical stage. We hypothesized that cognitive ability, as expressed in the voca...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Akihiro Shimoda, Yue Li, Hana Hayashi, Naoki Kondo
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/d528325e45cf4ee19a37e1b957f38f4a
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:d528325e45cf4ee19a37e1b957f38f4a
record_format dspace
spelling oai:doaj.org-article:d528325e45cf4ee19a37e1b957f38f4a2021-12-02T20:09:14ZDementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.1932-620310.1371/journal.pone.0253988https://doaj.org/article/d528325e45cf4ee19a37e1b957f38f4a2021-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0253988https://doaj.org/toc/1932-6203Due to difficulty in early diagnosis of Alzheimer's disease (AD) related to cost and differentiated capability, it is necessary to identify low-cost, accessible, and reliable tools for identifying AD risk in the preclinical stage. We hypothesized that cognitive ability, as expressed in the vocal features in daily conversation, is associated with AD progression. Thus, we have developed a novel machine learning prediction model to identify AD risk by using the rich voice data collected from daily conversations, and evaluated its predictive performance in comparison with a classification method based on the Japanese version of the Telephone Interview for Cognitive Status (TICS-J). We used 1,465 audio data files from 99 Healthy controls (HC) and 151 audio data files recorded from 24 AD patients derived from a dementia prevention program conducted by Hachioji City, Tokyo, between March and May 2020. After extracting vocal features from each audio file, we developed machine-learning models based on extreme gradient boosting (XGBoost), random forest (RF), and logistic regression (LR), using each audio file as one observation. We evaluated the predictive performance of the developed models by describing the receiver operating characteristic (ROC) curve, calculating the areas under the curve (AUCs), sensitivity, and specificity. Further, we conducted classifications by considering each participant as one observation, computing the average of their audio files' predictive value, and making comparisons with the predictive performance of the TICS-J based questionnaire. Of 1,616 audio files in total, 1,308 (81.0%) were randomly allocated to the training data and 308 (19.1%) to the validation data. For audio file-based prediction, the AUCs for XGboost, RF, and LR were 0.863 (95% confidence interval [CI]: 0.794-0.931), 0.882 (95% CI: 0.840-0.924), and 0.893 (95%CI: 0.832-0.954), respectively. For participant-based prediction, the AUC for XGboost, RF, LR, and TICS-J were 1.000 (95%CI: 1.000-1.000), 1.000 (95%CI: 1.000-1.000), 0.972 (95%CI: 0.918-1.000) and 0.917 (95%CI: 0.918-1.000), respectively. There was difference in predictive accuracy of XGBoost and TICS-J with almost approached significance (p = 0.065). Our novel prediction model using the vocal features of daily conversations demonstrated the potential to be useful for the AD risk assessment.Akihiro ShimodaYue LiHana HayashiNaoki KondoPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 16, Iss 7, p e0253988 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Akihiro Shimoda
Yue Li
Hana Hayashi
Naoki Kondo
Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.
description Due to difficulty in early diagnosis of Alzheimer's disease (AD) related to cost and differentiated capability, it is necessary to identify low-cost, accessible, and reliable tools for identifying AD risk in the preclinical stage. We hypothesized that cognitive ability, as expressed in the vocal features in daily conversation, is associated with AD progression. Thus, we have developed a novel machine learning prediction model to identify AD risk by using the rich voice data collected from daily conversations, and evaluated its predictive performance in comparison with a classification method based on the Japanese version of the Telephone Interview for Cognitive Status (TICS-J). We used 1,465 audio data files from 99 Healthy controls (HC) and 151 audio data files recorded from 24 AD patients derived from a dementia prevention program conducted by Hachioji City, Tokyo, between March and May 2020. After extracting vocal features from each audio file, we developed machine-learning models based on extreme gradient boosting (XGBoost), random forest (RF), and logistic regression (LR), using each audio file as one observation. We evaluated the predictive performance of the developed models by describing the receiver operating characteristic (ROC) curve, calculating the areas under the curve (AUCs), sensitivity, and specificity. Further, we conducted classifications by considering each participant as one observation, computing the average of their audio files' predictive value, and making comparisons with the predictive performance of the TICS-J based questionnaire. Of 1,616 audio files in total, 1,308 (81.0%) were randomly allocated to the training data and 308 (19.1%) to the validation data. For audio file-based prediction, the AUCs for XGboost, RF, and LR were 0.863 (95% confidence interval [CI]: 0.794-0.931), 0.882 (95% CI: 0.840-0.924), and 0.893 (95%CI: 0.832-0.954), respectively. For participant-based prediction, the AUC for XGboost, RF, LR, and TICS-J were 1.000 (95%CI: 1.000-1.000), 1.000 (95%CI: 1.000-1.000), 0.972 (95%CI: 0.918-1.000) and 0.917 (95%CI: 0.918-1.000), respectively. There was difference in predictive accuracy of XGBoost and TICS-J with almost approached significance (p = 0.065). Our novel prediction model using the vocal features of daily conversations demonstrated the potential to be useful for the AD risk assessment.
format article
author Akihiro Shimoda
Yue Li
Hana Hayashi
Naoki Kondo
author_facet Akihiro Shimoda
Yue Li
Hana Hayashi
Naoki Kondo
author_sort Akihiro Shimoda
title Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.
title_short Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.
title_full Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.
title_fullStr Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.
title_full_unstemmed Dementia risks identified by vocal features via telephone conversations: A novel machine learning prediction model.
title_sort dementia risks identified by vocal features via telephone conversations: a novel machine learning prediction model.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/d528325e45cf4ee19a37e1b957f38f4a
work_keys_str_mv AT akihiroshimoda dementiarisksidentifiedbyvocalfeaturesviatelephoneconversationsanovelmachinelearningpredictionmodel
AT yueli dementiarisksidentifiedbyvocalfeaturesviatelephoneconversationsanovelmachinelearningpredictionmodel
AT hanahayashi dementiarisksidentifiedbyvocalfeaturesviatelephoneconversationsanovelmachinelearningpredictionmodel
AT naokikondo dementiarisksidentifiedbyvocalfeaturesviatelephoneconversationsanovelmachinelearningpredictionmodel
_version_ 1718375152434020352