The whole is greater than its parts: ensembling improves protein contact prediction
Abstract The prediction of amino acid contacts from protein sequence is an important problem, as protein contacts are a vital step towards the prediction of folded protein structures. We propose that a powerful concept from deep learning, called ensembling, can increase the accuracy of protein conta...
Guardado en:
Autores principales: | , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/d52d01b461ea4fc19ffcfa13bd818ae8 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:d52d01b461ea4fc19ffcfa13bd818ae8 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:d52d01b461ea4fc19ffcfa13bd818ae82021-12-02T15:51:13ZThe whole is greater than its parts: ensembling improves protein contact prediction10.1038/s41598-021-87524-02045-2322https://doaj.org/article/d52d01b461ea4fc19ffcfa13bd818ae82021-04-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-87524-0https://doaj.org/toc/2045-2322Abstract The prediction of amino acid contacts from protein sequence is an important problem, as protein contacts are a vital step towards the prediction of folded protein structures. We propose that a powerful concept from deep learning, called ensembling, can increase the accuracy of protein contact predictions by combining the outputs of different neural network models. We show that ensembling the predictions made by different groups at the recent Critical Assessment of Protein Structure Prediction (CASP13) outperforms all individual groups. Further, we show that contacts derived from the distance predictions of three additional deep neural networks—AlphaFold, trRosetta, and ProSPr—can be substantially improved by ensembling all three networks. We also show that ensembling these recent deep neural networks with the best CASP13 group creates a superior contact prediction tool. Finally, we demonstrate that two ensembled networks can successfully differentiate between the folds of two highly homologous sequences. In order to build further on these findings, we propose the creation of a better protein contact benchmark set and additional open-source contact prediction methods.Wendy M. BillingsConnor J. MorrisDennis Della CorteNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-7 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Wendy M. Billings Connor J. Morris Dennis Della Corte The whole is greater than its parts: ensembling improves protein contact prediction |
description |
Abstract The prediction of amino acid contacts from protein sequence is an important problem, as protein contacts are a vital step towards the prediction of folded protein structures. We propose that a powerful concept from deep learning, called ensembling, can increase the accuracy of protein contact predictions by combining the outputs of different neural network models. We show that ensembling the predictions made by different groups at the recent Critical Assessment of Protein Structure Prediction (CASP13) outperforms all individual groups. Further, we show that contacts derived from the distance predictions of three additional deep neural networks—AlphaFold, trRosetta, and ProSPr—can be substantially improved by ensembling all three networks. We also show that ensembling these recent deep neural networks with the best CASP13 group creates a superior contact prediction tool. Finally, we demonstrate that two ensembled networks can successfully differentiate between the folds of two highly homologous sequences. In order to build further on these findings, we propose the creation of a better protein contact benchmark set and additional open-source contact prediction methods. |
format |
article |
author |
Wendy M. Billings Connor J. Morris Dennis Della Corte |
author_facet |
Wendy M. Billings Connor J. Morris Dennis Della Corte |
author_sort |
Wendy M. Billings |
title |
The whole is greater than its parts: ensembling improves protein contact prediction |
title_short |
The whole is greater than its parts: ensembling improves protein contact prediction |
title_full |
The whole is greater than its parts: ensembling improves protein contact prediction |
title_fullStr |
The whole is greater than its parts: ensembling improves protein contact prediction |
title_full_unstemmed |
The whole is greater than its parts: ensembling improves protein contact prediction |
title_sort |
whole is greater than its parts: ensembling improves protein contact prediction |
publisher |
Nature Portfolio |
publishDate |
2021 |
url |
https://doaj.org/article/d52d01b461ea4fc19ffcfa13bd818ae8 |
work_keys_str_mv |
AT wendymbillings thewholeisgreaterthanitspartsensemblingimprovesproteincontactprediction AT connorjmorris thewholeisgreaterthanitspartsensemblingimprovesproteincontactprediction AT dennisdellacorte thewholeisgreaterthanitspartsensemblingimprovesproteincontactprediction AT wendymbillings wholeisgreaterthanitspartsensemblingimprovesproteincontactprediction AT connorjmorris wholeisgreaterthanitspartsensemblingimprovesproteincontactprediction AT dennisdellacorte wholeisgreaterthanitspartsensemblingimprovesproteincontactprediction |
_version_ |
1718385660884156416 |