Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.

The topology of protein folds can be specified by the inter-residue contact-maps and accurate contact-map prediction can help ab initio structure folding. We developed TripletRes to deduce protein contact-maps from discretized distance profiles by end-to-end training of deep residual neural-networks...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Yang Li, Chengxin Zhang, Eric W Bell, Wei Zheng, Xiaogen Zhou, Dong-Jun Yu, Yang Zhang
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
Acceso en línea:https://doaj.org/article/ebd691bba94f4b35990eb6e0cb86a802
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:ebd691bba94f4b35990eb6e0cb86a802
record_format dspace
spelling oai:doaj.org-article:ebd691bba94f4b35990eb6e0cb86a8022021-12-02T19:58:14ZDeducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.1553-734X1553-735810.1371/journal.pcbi.1008865https://doaj.org/article/ebd691bba94f4b35990eb6e0cb86a8022021-03-01T00:00:00Zhttps://doi.org/10.1371/journal.pcbi.1008865https://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358The topology of protein folds can be specified by the inter-residue contact-maps and accurate contact-map prediction can help ab initio structure folding. We developed TripletRes to deduce protein contact-maps from discretized distance profiles by end-to-end training of deep residual neural-networks. Compared to previous approaches, the major advantage of TripletRes is in its ability to learn and directly fuse a triplet of coevolutionary matrices extracted from the whole-genome and metagenome databases and therefore minimize the information loss during the course of contact model training. TripletRes was tested on a large set of 245 non-homologous proteins from CASP 11&12 and CAMEO experiments and outperformed other top methods from CASP12 by at least 58.4% for the CASP 11&12 targets and 44.4% for the CAMEO targets in the top-L long-range contact precision. On the 31 FM targets from the latest CASP13 challenge, TripletRes achieved the highest precision (71.6%) for the top-L/5 long-range contact predictions. It was also shown that a simple re-training of the TripletRes model with more proteins can lead to further improvement with precisions comparable to state-of-the-art methods developed after CASP13. These results demonstrate a novel efficient approach to extend the power of deep convolutional networks for high-accuracy medium- and long-range protein contact-map predictions starting from primary sequences, which are critical for constructing 3D structure of proteins that lack homologous templates in the PDB library.Yang LiChengxin ZhangEric W BellWei ZhengXiaogen ZhouDong-Jun YuYang ZhangPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 17, Iss 3, p e1008865 (2021)
institution DOAJ
collection DOAJ
language EN
topic Biology (General)
QH301-705.5
spellingShingle Biology (General)
QH301-705.5
Yang Li
Chengxin Zhang
Eric W Bell
Wei Zheng
Xiaogen Zhou
Dong-Jun Yu
Yang Zhang
Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
description The topology of protein folds can be specified by the inter-residue contact-maps and accurate contact-map prediction can help ab initio structure folding. We developed TripletRes to deduce protein contact-maps from discretized distance profiles by end-to-end training of deep residual neural-networks. Compared to previous approaches, the major advantage of TripletRes is in its ability to learn and directly fuse a triplet of coevolutionary matrices extracted from the whole-genome and metagenome databases and therefore minimize the information loss during the course of contact model training. TripletRes was tested on a large set of 245 non-homologous proteins from CASP 11&12 and CAMEO experiments and outperformed other top methods from CASP12 by at least 58.4% for the CASP 11&12 targets and 44.4% for the CAMEO targets in the top-L long-range contact precision. On the 31 FM targets from the latest CASP13 challenge, TripletRes achieved the highest precision (71.6%) for the top-L/5 long-range contact predictions. It was also shown that a simple re-training of the TripletRes model with more proteins can lead to further improvement with precisions comparable to state-of-the-art methods developed after CASP13. These results demonstrate a novel efficient approach to extend the power of deep convolutional networks for high-accuracy medium- and long-range protein contact-map predictions starting from primary sequences, which are critical for constructing 3D structure of proteins that lack homologous templates in the PDB library.
format article
author Yang Li
Chengxin Zhang
Eric W Bell
Wei Zheng
Xiaogen Zhou
Dong-Jun Yu
Yang Zhang
author_facet Yang Li
Chengxin Zhang
Eric W Bell
Wei Zheng
Xiaogen Zhou
Dong-Jun Yu
Yang Zhang
author_sort Yang Li
title Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
title_short Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
title_full Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
title_fullStr Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
title_full_unstemmed Deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
title_sort deducing high-accuracy protein contact-maps from a triplet of coevolutionary matrices through deep residual convolutional networks.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/ebd691bba94f4b35990eb6e0cb86a802
work_keys_str_mv AT yangli deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
AT chengxinzhang deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
AT ericwbell deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
AT weizheng deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
AT xiaogenzhou deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
AT dongjunyu deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
AT yangzhang deducinghighaccuracyproteincontactmapsfromatripletofcoevolutionarymatricesthroughdeepresidualconvolutionalnetworks
_version_ 1718375813719523328