Curated BLAST for Genomes

ABSTRACT Curated BLAST for Genomes finds candidate genes for a process or an enzymatic activity within a genome of interest. In contrast to annotation tools, which usually predict a single activity for each protein, Curated BLAST asks if any of the proteins in the genome are similar to characterized...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Morgan N. Price, Adam P. Arkin
Formato: article
Lenguaje:EN
Publicado: American Society for Microbiology 2019
Materias:
Acceso en línea:https://doaj.org/article/f5b2f47470534ff099b8a3d1b98204e0
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:f5b2f47470534ff099b8a3d1b98204e0
record_format dspace
spelling oai:doaj.org-article:f5b2f47470534ff099b8a3d1b98204e02021-12-02T18:39:15ZCurated BLAST for Genomes10.1128/mSystems.00072-192379-5077https://doaj.org/article/f5b2f47470534ff099b8a3d1b98204e02019-04-01T00:00:00Zhttps://journals.asm.org/doi/10.1128/mSystems.00072-19https://doaj.org/toc/2379-5077ABSTRACT Curated BLAST for Genomes finds candidate genes for a process or an enzymatic activity within a genome of interest. In contrast to annotation tools, which usually predict a single activity for each protein, Curated BLAST asks if any of the proteins in the genome are similar to characterized proteins that are relevant. Given a query such as an enzyme’s name or an EC number, Curated BLAST searches the curated descriptions of over 100,000 characterized proteins, and it compares the relevant characterized proteins to the predicted proteins in the genome of interest. In case of errors in the gene models, Curated BLAST also searches the six-frame translation of the genome. Curated BLAST is available at http://papers.genomics.lbl.gov/curated. IMPORTANCE Given a microbe’s genome sequence, we often want to predict what capabilities the organism has, such as which nutrients it requires or which energy sources it can use. Or, we know the organism has a capability and we want to find the genes involved. Scientists often use automated gene annotations to find relevant genes, but automated annotations are often vague or incorrect. Curated BLAST finds candidate genes for a capability without relying on automated annotations. First, Curated BLAST finds proteins (usually from other organisms) whose functions have been studied experimentally and whose curated descriptions match a query. Then, it searches the genome of interest for similar proteins and returns a list of candidates. Curated BLAST is fast and often finds relevant genes that are missed by automated annotation.Morgan N. PriceAdam P. ArkinAmerican Society for MicrobiologyarticleannotationMicrobiologyQR1-502ENmSystems, Vol 4, Iss 2 (2019)
institution DOAJ
collection DOAJ
language EN
topic annotation
Microbiology
QR1-502
spellingShingle annotation
Microbiology
QR1-502
Morgan N. Price
Adam P. Arkin
Curated BLAST for Genomes
description ABSTRACT Curated BLAST for Genomes finds candidate genes for a process or an enzymatic activity within a genome of interest. In contrast to annotation tools, which usually predict a single activity for each protein, Curated BLAST asks if any of the proteins in the genome are similar to characterized proteins that are relevant. Given a query such as an enzyme’s name or an EC number, Curated BLAST searches the curated descriptions of over 100,000 characterized proteins, and it compares the relevant characterized proteins to the predicted proteins in the genome of interest. In case of errors in the gene models, Curated BLAST also searches the six-frame translation of the genome. Curated BLAST is available at http://papers.genomics.lbl.gov/curated. IMPORTANCE Given a microbe’s genome sequence, we often want to predict what capabilities the organism has, such as which nutrients it requires or which energy sources it can use. Or, we know the organism has a capability and we want to find the genes involved. Scientists often use automated gene annotations to find relevant genes, but automated annotations are often vague or incorrect. Curated BLAST finds candidate genes for a capability without relying on automated annotations. First, Curated BLAST finds proteins (usually from other organisms) whose functions have been studied experimentally and whose curated descriptions match a query. Then, it searches the genome of interest for similar proteins and returns a list of candidates. Curated BLAST is fast and often finds relevant genes that are missed by automated annotation.
format article
author Morgan N. Price
Adam P. Arkin
author_facet Morgan N. Price
Adam P. Arkin
author_sort Morgan N. Price
title Curated BLAST for Genomes
title_short Curated BLAST for Genomes
title_full Curated BLAST for Genomes
title_fullStr Curated BLAST for Genomes
title_full_unstemmed Curated BLAST for Genomes
title_sort curated blast for genomes
publisher American Society for Microbiology
publishDate 2019
url https://doaj.org/article/f5b2f47470534ff099b8a3d1b98204e0
work_keys_str_mv AT morgannprice curatedblastforgenomes
AT adamparkin curatedblastforgenomes
_version_ 1718377776470294528