KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-relat...
Guardado en:
Autores principales: | , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Frontiers Media S.A.
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/21685494e2854c9cae05cfe5b726eb66 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:21685494e2854c9cae05cfe5b726eb66 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:21685494e2854c9cae05cfe5b726eb662021-12-01T14:21:44ZKK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest1664-802110.3389/fgene.2021.811158https://doaj.org/article/21685494e2854c9cae05cfe5b726eb662021-11-01T00:00:00Zhttps://www.frontiersin.org/articles/10.3389/fgene.2021.811158/fullhttps://doaj.org/toc/1664-8021DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-related features, resulting in rough prediction results. In this article, we develop a DNA-binding protein identification method called KK-DBP. To improve prediction accuracy, we propose a feature extraction method that fuses multiple PSSM features. The experimental results show a prediction accuracy on the independent test dataset PDB186 of 81.22%, which is the highest of all existing methods.Yuran JiaShan HuangTianjiao ZhangFrontiers Media S.A.articleDNA-binding proteinposition specificity score matrixrandom forestfeature extractionmulti-feature fusionGeneticsQH426-470ENFrontiers in Genetics, Vol 12 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
DNA-binding protein position specificity score matrix random forest feature extraction multi-feature fusion Genetics QH426-470 |
spellingShingle |
DNA-binding protein position specificity score matrix random forest feature extraction multi-feature fusion Genetics QH426-470 Yuran Jia Shan Huang Tianjiao Zhang KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest |
description |
DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-related features, resulting in rough prediction results. In this article, we develop a DNA-binding protein identification method called KK-DBP. To improve prediction accuracy, we propose a feature extraction method that fuses multiple PSSM features. The experimental results show a prediction accuracy on the independent test dataset PDB186 of 81.22%, which is the highest of all existing methods. |
format |
article |
author |
Yuran Jia Shan Huang Tianjiao Zhang |
author_facet |
Yuran Jia Shan Huang Tianjiao Zhang |
author_sort |
Yuran Jia |
title |
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest |
title_short |
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest |
title_full |
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest |
title_fullStr |
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest |
title_full_unstemmed |
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest |
title_sort |
kk-dbp: a multi-feature fusion method for dna-binding protein identification based on random forest |
publisher |
Frontiers Media S.A. |
publishDate |
2021 |
url |
https://doaj.org/article/21685494e2854c9cae05cfe5b726eb66 |
work_keys_str_mv |
AT yuranjia kkdbpamultifeaturefusionmethodfordnabindingproteinidentificationbasedonrandomforest AT shanhuang kkdbpamultifeaturefusionmethodfordnabindingproteinidentificationbasedonrandomforest AT tianjiaozhang kkdbpamultifeaturefusionmethodfordnabindingproteinidentificationbasedonrandomforest |
_version_ |
1718405044632551424 |