Master Blaster: an approach to sensitive identification of remotely related proteins

Abstract Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our abil...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Chintalapati Janaki, Venkatraman S. Gowri, Narayanaswamy Srinivasan
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/a141573a94c1465987955fa87d4a529f
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:a141573a94c1465987955fa87d4a529f
record_format dspace
spelling oai:doaj.org-article:a141573a94c1465987955fa87d4a529f2021-12-02T18:27:49ZMaster Blaster: an approach to sensitive identification of remotely related proteins10.1038/s41598-021-87833-42045-2322https://doaj.org/article/a141573a94c1465987955fa87d4a529f2021-04-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-87833-4https://doaj.org/toc/2045-2322Abstract Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to detect distant relationships. In the past, an approach involving multiple Position Specific Scoring Matrices (PSSMs) was found to be more effective than traditional single PSSMs. Cascaded search is another successful approach where hits of a search are queried to detect more homologues. We propose a protocol, ‘Master Blaster’, which combines the principles adopted in these two approaches to enhance our ability to detect remote homologues even further. Assessment of the approach was performed using known relationships available in the SCOP70 database, and the results were compared against that of PSI-BLAST and HHblits, a hidden Markov model-based method. Compared to PSI-BLAST, Master Blaster resulted in 10% improvement with respect to detection of cross superfamily connections, nearly 35% improvement in cross family and more than 80% improvement in intra family connections. From the results it was observed that HHblits is more sensitive in detecting remote homologues compared to Master Blaster. However, there are true hits from 46-folds for which Master Blaster reported homologs that are not reported by HHblits even using the optimal parameters indicating that for detecting remote homologues, use of multiple methods employing a combination of different approaches can be more effective in detecting remote homologs. Master Blaster stand-alone code is available for download in the supplementary archive.Chintalapati JanakiVenkatraman S. GowriNarayanaswamy SrinivasanNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-14 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Chintalapati Janaki
Venkatraman S. Gowri
Narayanaswamy Srinivasan
Master Blaster: an approach to sensitive identification of remotely related proteins
description Abstract Genome sequencing projects unearth sequences of all the protein sequences encoded in a genome. As the first step, homology detection is employed to obtain clues to structure and function of these proteins. However, high evolutionary divergence between homologous proteins challenges our ability to detect distant relationships. In the past, an approach involving multiple Position Specific Scoring Matrices (PSSMs) was found to be more effective than traditional single PSSMs. Cascaded search is another successful approach where hits of a search are queried to detect more homologues. We propose a protocol, ‘Master Blaster’, which combines the principles adopted in these two approaches to enhance our ability to detect remote homologues even further. Assessment of the approach was performed using known relationships available in the SCOP70 database, and the results were compared against that of PSI-BLAST and HHblits, a hidden Markov model-based method. Compared to PSI-BLAST, Master Blaster resulted in 10% improvement with respect to detection of cross superfamily connections, nearly 35% improvement in cross family and more than 80% improvement in intra family connections. From the results it was observed that HHblits is more sensitive in detecting remote homologues compared to Master Blaster. However, there are true hits from 46-folds for which Master Blaster reported homologs that are not reported by HHblits even using the optimal parameters indicating that for detecting remote homologues, use of multiple methods employing a combination of different approaches can be more effective in detecting remote homologs. Master Blaster stand-alone code is available for download in the supplementary archive.
format article
author Chintalapati Janaki
Venkatraman S. Gowri
Narayanaswamy Srinivasan
author_facet Chintalapati Janaki
Venkatraman S. Gowri
Narayanaswamy Srinivasan
author_sort Chintalapati Janaki
title Master Blaster: an approach to sensitive identification of remotely related proteins
title_short Master Blaster: an approach to sensitive identification of remotely related proteins
title_full Master Blaster: an approach to sensitive identification of remotely related proteins
title_fullStr Master Blaster: an approach to sensitive identification of remotely related proteins
title_full_unstemmed Master Blaster: an approach to sensitive identification of remotely related proteins
title_sort master blaster: an approach to sensitive identification of remotely related proteins
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/a141573a94c1465987955fa87d4a529f
work_keys_str_mv AT chintalapatijanaki masterblasteranapproachtosensitiveidentificationofremotelyrelatedproteins
AT venkatramansgowri masterblasteranapproachtosensitiveidentificationofremotelyrelatedproteins
AT narayanaswamysrinivasan masterblasteranapproachtosensitiveidentificationofremotelyrelatedproteins
_version_ 1718377997401063424