KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest

DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-relat...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yuran Jia, Shan Huang, Tianjiao Zhang
Formato: article
Lenguaje:EN
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://doaj.org/article/21685494e2854c9cae05cfe5b726eb66
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:21685494e2854c9cae05cfe5b726eb66
record_format dspace
spelling oai:doaj.org-article:21685494e2854c9cae05cfe5b726eb662021-12-01T14:21:44ZKK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest1664-802110.3389/fgene.2021.811158https://doaj.org/article/21685494e2854c9cae05cfe5b726eb662021-11-01T00:00:00Zhttps://www.frontiersin.org/articles/10.3389/fgene.2021.811158/fullhttps://doaj.org/toc/1664-8021DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-related features, resulting in rough prediction results. In this article, we develop a DNA-binding protein identification method called KK-DBP. To improve prediction accuracy, we propose a feature extraction method that fuses multiple PSSM features. The experimental results show a prediction accuracy on the independent test dataset PDB186 of 81.22%, which is the highest of all existing methods.Yuran JiaShan HuangTianjiao ZhangFrontiers Media S.A.articleDNA-binding proteinposition specificity score matrixrandom forestfeature extractionmulti-feature fusionGeneticsQH426-470ENFrontiers in Genetics, Vol 12 (2021)
institution DOAJ
collection DOAJ
language EN
topic DNA-binding protein
position specificity score matrix
random forest
feature extraction
multi-feature fusion
Genetics
QH426-470
spellingShingle DNA-binding protein
position specificity score matrix
random forest
feature extraction
multi-feature fusion
Genetics
QH426-470
Yuran Jia
Shan Huang
Tianjiao Zhang
KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
description DNA-binding protein (DBP) is a protein with a special DNA binding domain that is associated with many important molecular biological mechanisms. Rapid development of computational methods has made it possible to predict DBP on a large scale; however, existing methods do not fully integrate DBP-related features, resulting in rough prediction results. In this article, we develop a DNA-binding protein identification method called KK-DBP. To improve prediction accuracy, we propose a feature extraction method that fuses multiple PSSM features. The experimental results show a prediction accuracy on the independent test dataset PDB186 of 81.22%, which is the highest of all existing methods.
format article
author Yuran Jia
Shan Huang
Tianjiao Zhang
author_facet Yuran Jia
Shan Huang
Tianjiao Zhang
author_sort Yuran Jia
title KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
title_short KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
title_full KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
title_fullStr KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
title_full_unstemmed KK-DBP: A Multi-Feature Fusion Method for DNA-Binding Protein Identification Based on Random Forest
title_sort kk-dbp: a multi-feature fusion method for dna-binding protein identification based on random forest
publisher Frontiers Media S.A.
publishDate 2021
url https://doaj.org/article/21685494e2854c9cae05cfe5b726eb66
work_keys_str_mv AT yuranjia kkdbpamultifeaturefusionmethodfordnabindingproteinidentificationbasedonrandomforest
AT shanhuang kkdbpamultifeaturefusionmethodfordnabindingproteinidentificationbasedonrandomforest
AT tianjiaozhang kkdbpamultifeaturefusionmethodfordnabindingproteinidentificationbasedonrandomforest
_version_ 1718405044632551424