Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes
Hormone binding protein (HBP) is a soluble carrier protein that interacts selectively with different types of hormones and has various effects on the body’s life activities. HBPs play an important role in the growth process of organisms, but their specific role is still unclear. Therefore, correctly...
Guardado en:
Autores principales: | , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/624657e15b19456789f0edd15156e197 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:624657e15b19456789f0edd15156e197 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:624657e15b19456789f0edd15156e1972021-11-30T13:27:59ZPrediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes1664-802110.3389/fgene.2021.797641https://doaj.org/article/624657e15b19456789f0edd15156e1972021-11-01T00:00:00Zhttps://www.frontiersin.org/articles/10.3389/fgene.2021.797641/fullhttps://doaj.org/toc/1664-8021Hormone binding protein (HBP) is a soluble carrier protein that interacts selectively with different types of hormones and has various effects on the body’s life activities. HBPs play an important role in the growth process of organisms, but their specific role is still unclear. Therefore, correctly identifying HBPs is the first step towards understanding and studying their biological function. However, due to their high cost and long experimental period, it is difficult for traditional biochemical experiments to correctly identify HBPs from an increasing number of proteins, so the real characterization of HBPs has become a challenging task for researchers. To measure the effectiveness of HBPs, an accurate and reliable prediction model for their identification is desirable. In this paper, we construct the prediction model HBP_NB. First, HBPs data were collected from the UniProt database, and a dataset was established. Then, based on the established high-quality dataset, the k-mer (K = 3) feature representation method was used to extract features. Second, the feature selection algorithm was used to reduce the dimensionality of the extracted features and select the appropriate optimal feature set. Finally, the selected features are input into Naive Bayes to construct the prediction model, and the model is evaluated by using 10-fold cross-validation. The final results were 95.45% accuracy, 94.17% sensitivity and 96.73% specificity. These results indicate that our model is feasible and effective.Yuxin GuoYuxin GuoYuxin GuoYuxin GuoLiping HouWen ZhuWen ZhuWen ZhuPeng WangPeng WangPeng WangFrontiers Media S.A.articlehormone binding proteinfeature selectionprotein classificationk-mernaive Bayes modelGeneticsQH426-470ENFrontiers in Genetics, Vol 12 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
hormone binding protein feature selection protein classification k-mer naive Bayes model Genetics QH426-470 |
spellingShingle |
hormone binding protein feature selection protein classification k-mer naive Bayes model Genetics QH426-470 Yuxin Guo Yuxin Guo Yuxin Guo Yuxin Guo Liping Hou Wen Zhu Wen Zhu Wen Zhu Peng Wang Peng Wang Peng Wang Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes |
description |
Hormone binding protein (HBP) is a soluble carrier protein that interacts selectively with different types of hormones and has various effects on the body’s life activities. HBPs play an important role in the growth process of organisms, but their specific role is still unclear. Therefore, correctly identifying HBPs is the first step towards understanding and studying their biological function. However, due to their high cost and long experimental period, it is difficult for traditional biochemical experiments to correctly identify HBPs from an increasing number of proteins, so the real characterization of HBPs has become a challenging task for researchers. To measure the effectiveness of HBPs, an accurate and reliable prediction model for their identification is desirable. In this paper, we construct the prediction model HBP_NB. First, HBPs data were collected from the UniProt database, and a dataset was established. Then, based on the established high-quality dataset, the k-mer (K = 3) feature representation method was used to extract features. Second, the feature selection algorithm was used to reduce the dimensionality of the extracted features and select the appropriate optimal feature set. Finally, the selected features are input into Naive Bayes to construct the prediction model, and the model is evaluated by using 10-fold cross-validation. The final results were 95.45% accuracy, 94.17% sensitivity and 96.73% specificity. These results indicate that our model is feasible and effective. |
format |
article |
author |
Yuxin Guo Yuxin Guo Yuxin Guo Yuxin Guo Liping Hou Wen Zhu Wen Zhu Wen Zhu Peng Wang Peng Wang Peng Wang |
author_facet |
Yuxin Guo Yuxin Guo Yuxin Guo Yuxin Guo Liping Hou Wen Zhu Wen Zhu Wen Zhu Peng Wang Peng Wang Peng Wang |
author_sort |
Yuxin Guo |
title |
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes |
title_short |
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes |
title_full |
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes |
title_fullStr |
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes |
title_full_unstemmed |
Prediction of Hormone-Binding Proteins Based on K-mer Feature Representation and Naive Bayes |
title_sort |
prediction of hormone-binding proteins based on k-mer feature representation and naive bayes |
publisher |
Frontiers Media S.A. |
publishDate |
2021 |
url |
https://doaj.org/article/624657e15b19456789f0edd15156e197 |
work_keys_str_mv |
AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT yuxinguo predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT lipinghou predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT wenzhu predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT wenzhu predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT wenzhu predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT pengwang predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT pengwang predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes AT pengwang predictionofhormonebindingproteinsbasedonkmerfeaturerepresentationandnaivebayes |
_version_ |
1718406565852086272 |