Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes

Hormone binding protein (HBP) is a soluble carrier protein that interacts selectively with different types of hormones and has various effects on the body’s life activities. HBPs play an important role in the growth process of organisms, but their specific role is still unclear. Therefore, correctly...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yuxin Guo, Liping Hou, Wen Zhu, Peng Wang
Formato: article
Lenguaje:EN
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://doaj.org/article/624657e15b19456789f0edd15156e197
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:624657e15b19456789f0edd15156e197
record_format dspace
spelling oai:doaj.org-article:624657e15b19456789f0edd15156e1972021-11-30T13:27:59ZPrediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes1664-802110.3389/fgene.2021.797641https://doaj.org/article/624657e15b19456789f0edd15156e1972021-11-01T00:00:00Zhttps://www.frontiersin.org/articles/10.3389/fgene.2021.797641/fullhttps://doaj.org/toc/1664-8021Hormone binding protein (HBP) is a soluble carrier protein that interacts selectively with different types of hormones and has various effects on the body’s life activities. HBPs play an important role in the growth process of organisms, but their specific role is still unclear. Therefore, correctly identifying HBPs is the first step towards understanding and studying their biological function. However, due to their high cost and long experimental period, it is difficult for traditional biochemical experiments to correctly identify HBPs from an increasing number of proteins, so the real characterization of HBPs has become a challenging task for researchers. To measure the effectiveness of HBPs, an accurate and reliable prediction model for their identification is desirable. In this paper, we construct the prediction model HBP_NB. First, HBPs data were collected from the UniProt database, and a dataset was established. Then, based on the established high-quality dataset, the k-mer (K = 3) feature representation method was used to extract features. Second, the feature selection algorithm was used to reduce the dimensionality of the extracted features and select the appropriate optimal feature set. Finally, the selected features are input into Naive Bayes to construct the prediction model, and the model is evaluated by using 10-fold cross-validation. The final results were 95.45% accuracy, 94.17% sensitivity and 96.73% specificity. These results indicate that our model is feasible and effective.Yuxin GuoYuxin GuoYuxin GuoYuxin GuoLiping HouWen ZhuWen ZhuWen ZhuPeng WangPeng WangPeng WangFrontiers Media S.A.articlehormone binding proteinfeature selectionprotein classificationk-mernaive Bayes modelGeneticsQH426-470ENFrontiers in Genetics, Vol 12 (2021)
institution DOAJ
collection DOAJ
language EN
topic hormone binding protein
feature selection
protein classification
k-mer
naive Bayes model
Genetics
QH426-470
spellingShingle hormone binding protein
feature selection
protein classification
k-mer
naive Bayes model
Genetics
QH426-470
Yuxin Guo
Yuxin Guo
Yuxin Guo
Yuxin Guo
Liping Hou
Wen Zhu
Wen Zhu
Wen Zhu
Peng Wang
Peng Wang
Peng Wang
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
description Hormone binding protein (HBP) is a soluble carrier protein that interacts selectively with different types of hormones and has various effects on the body’s life activities. HBPs play an important role in the growth process of organisms, but their specific role is still unclear. Therefore, correctly identifying HBPs is the first step towards understanding and studying their biological function. However, due to their high cost and long experimental period, it is difficult for traditional biochemical experiments to correctly identify HBPs from an increasing number of proteins, so the real characterization of HBPs has become a challenging task for researchers. To measure the effectiveness of HBPs, an accurate and reliable prediction model for their identification is desirable. In this paper, we construct the prediction model HBP_NB. First, HBPs data were collected from the UniProt database, and a dataset was established. Then, based on the established high-quality dataset, the k-mer (K = 3) feature representation method was used to extract features. Second, the feature selection algorithm was used to reduce the dimensionality of the extracted features and select the appropriate optimal feature set. Finally, the selected features are input into Naive Bayes to construct the prediction model, and the model is evaluated by using 10-fold cross-validation. The final results were 95.45% accuracy, 94.17% sensitivity and 96.73% specificity. These results indicate that our model is feasible and effective.
format article
author Yuxin Guo
Yuxin Guo
Yuxin Guo
Yuxin Guo
Liping Hou
Wen Zhu
Wen Zhu
Wen Zhu
Peng Wang
Peng Wang
Peng Wang
author_facet Yuxin Guo
Yuxin Guo
Yuxin Guo
Yuxin Guo
Liping Hou
Wen Zhu
Wen Zhu
Wen Zhu
Peng Wang
Peng Wang
Peng Wang
author_sort Yuxin Guo
title Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
title_short Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
title_full Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
title_fullStr Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
title_full_unstemmed Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
title_sort prediction of hormone-binding proteins based on k-mer feature representation and naive bayes
publisher Frontiers Media S.A.
publishDate 2021
url https://doaj.org/article/624657e15b19456789f0edd15156e197
work_keys_str_mv AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT lipinghou predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT wenzhu predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT wenzhu predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT wenzhu predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT pengwang predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT pengwang predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
AT pengwang predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes
_version_ 1718406565852086272