Chapter 9: Analyses using disease ontologies.

Advanced statistical methods used to analyze high-throughput data such as gene-expression assays result in long lists of "significant genes." One way to gain insight into the significance of altered expression levels is to determine whether Gene Ontology (GO) terms associated with a partic...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Nigam H Shah, Tyler Cole, Mark A Musen
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2012
Materias:
Acceso en línea:https://doaj.org/article/b15b6283db404acc8279c088cbac6c2f
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:b15b6283db404acc8279c088cbac6c2f
record_format dspace
spelling oai:doaj.org-article:b15b6283db404acc8279c088cbac6c2f2021-11-18T05:52:35ZChapter 9: Analyses using disease ontologies.1553-734X1553-735810.1371/journal.pcbi.1002827https://doaj.org/article/b15b6283db404acc8279c088cbac6c2f2012-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/23300417/pdf/?tool=EBIhttps://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358Advanced statistical methods used to analyze high-throughput data such as gene-expression assays result in long lists of "significant genes." One way to gain insight into the significance of altered expression levels is to determine whether Gene Ontology (GO) terms associated with a particular biological process, molecular function, or cellular component are over- or under-represented in the set of genes deemed significant. This process, referred to as enrichment analysis, profiles a gene-set, and is widely used to makes sense of the results of high-throughput experiments. The canonical example of enrichment analysis is when the output dataset is a list of genes differentially expressed in some condition. To determine the biological relevance of a lengthy gene list, the usual solution is to perform enrichment analysis with the GO. We can aggregate the annotating GO concepts for each gene in this list, and arrive at a profile of the biological processes or mechanisms affected by the condition under study. While GO has been the principal target for enrichment analysis, the methods of enrichment analysis are generalizable. We can conduct the same sort of profiling along other ontologies of interest. Just as scientists can ask "Which biological process is over-represented in my set of interesting genes or proteins?" we can also ask "Which disease (or class of diseases) is over-represented in my set of interesting genes or proteins?". For example, by annotating known protein mutations with disease terms from the ontologies in BioPortal, Mort et al. recently identified a class of diseases--blood coagulation disorders--that were associated with a 14-fold depletion in substitutions at O-linked glycosylation sites. With the availability of tools for automatic annotation of datasets with terms from disease ontologies, there is no reason to restrict enrichment analyses to the GO. In this chapter, we will discuss methods to perform enrichment analysis using any ontology available in the biomedical domain. We will review the general methodology of enrichment analysis, the associated challenges, and discuss the novel translational analyses enabled by the existence of public, national computational infrastructure and by the use of disease ontologies in such analyses.Nigam H ShahTyler ColeMark A MusenPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 8, Iss 12, p e1002827 (2012)
institution DOAJ
collection DOAJ
language EN
topic Biology (General)
QH301-705.5
spellingShingle Biology (General)
QH301-705.5
Nigam H Shah
Tyler Cole
Mark A Musen
Chapter 9: Analyses using disease ontologies.
description Advanced statistical methods used to analyze high-throughput data such as gene-expression assays result in long lists of "significant genes." One way to gain insight into the significance of altered expression levels is to determine whether Gene Ontology (GO) terms associated with a particular biological process, molecular function, or cellular component are over- or under-represented in the set of genes deemed significant. This process, referred to as enrichment analysis, profiles a gene-set, and is widely used to makes sense of the results of high-throughput experiments. The canonical example of enrichment analysis is when the output dataset is a list of genes differentially expressed in some condition. To determine the biological relevance of a lengthy gene list, the usual solution is to perform enrichment analysis with the GO. We can aggregate the annotating GO concepts for each gene in this list, and arrive at a profile of the biological processes or mechanisms affected by the condition under study. While GO has been the principal target for enrichment analysis, the methods of enrichment analysis are generalizable. We can conduct the same sort of profiling along other ontologies of interest. Just as scientists can ask "Which biological process is over-represented in my set of interesting genes or proteins?" we can also ask "Which disease (or class of diseases) is over-represented in my set of interesting genes or proteins?". For example, by annotating known protein mutations with disease terms from the ontologies in BioPortal, Mort et al. recently identified a class of diseases--blood coagulation disorders--that were associated with a 14-fold depletion in substitutions at O-linked glycosylation sites. With the availability of tools for automatic annotation of datasets with terms from disease ontologies, there is no reason to restrict enrichment analyses to the GO. In this chapter, we will discuss methods to perform enrichment analysis using any ontology available in the biomedical domain. We will review the general methodology of enrichment analysis, the associated challenges, and discuss the novel translational analyses enabled by the existence of public, national computational infrastructure and by the use of disease ontologies in such analyses.
format article
author Nigam H Shah
Tyler Cole
Mark A Musen
author_facet Nigam H Shah
Tyler Cole
Mark A Musen
author_sort Nigam H Shah
title Chapter 9: Analyses using disease ontologies.
title_short Chapter 9: Analyses using disease ontologies.
title_full Chapter 9: Analyses using disease ontologies.
title_fullStr Chapter 9: Analyses using disease ontologies.
title_full_unstemmed Chapter 9: Analyses using disease ontologies.
title_sort chapter 9: analyses using disease ontologies.
publisher Public Library of Science (PLoS)
publishDate 2012
url https://doaj.org/article/b15b6283db404acc8279c088cbac6c2f
work_keys_str_mv AT nigamhshah chapter9analysesusingdiseaseontologies
AT tylercole chapter9analysesusingdiseaseontologies
AT markamusen chapter9analysesusingdiseaseontologies
_version_ 1718424749285048320