deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.

Gene fusions created by somatic genomic rearrangements are known to play an important role in the onset and development of some cancers, such as lymphomas and sarcomas. RNA-Seq (whole transcriptome shotgun sequencing) is proving to be a useful tool for the discovery of novel gene fusions in cancer t...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Andrew McPherson, Fereydoun Hormozdiari, Abdalnasser Zayed, Ryan Giuliany, Gavin Ha, Mark G F Sun, Malachi Griffith, Alireza Heravi Moussavi, Janine Senz, Nataliya Melnyk, Marina Pacheco, Marco A Marra, Martin Hirst, Torsten O Nielsen, S Cenk Sahinalp, David Huntsman, Sohrab P Shah
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2011
Materias:
Acceso en línea:https://doaj.org/article/c9b389b739544cc28b879c2f050782a1
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:c9b389b739544cc28b879c2f050782a1
record_format dspace
spelling oai:doaj.org-article:c9b389b739544cc28b879c2f050782a12021-11-18T05:50:32ZdeFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.1553-734X1553-735810.1371/journal.pcbi.1001138https://doaj.org/article/c9b389b739544cc28b879c2f050782a12011-05-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/21625565/?tool=EBIhttps://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358Gene fusions created by somatic genomic rearrangements are known to play an important role in the onset and development of some cancers, such as lymphomas and sarcomas. RNA-Seq (whole transcriptome shotgun sequencing) is proving to be a useful tool for the discovery of novel gene fusions in cancer transcriptomes. However, algorithmic methods for the discovery of gene fusions using RNA-Seq data remain underdeveloped. We have developed deFuse, a novel computational method for fusion discovery in tumor RNA-Seq data. Unlike existing methods that use only unique best-hit alignments and consider only fusion boundaries at the ends of known exons, deFuse considers all alignments and all possible locations for fusion boundaries. As a result, deFuse is able to identify fusion sequences with demonstrably better sensitivity than previous approaches. To increase the specificity of our approach, we curated a list of 60 true positive and 61 true negative fusion sequences (as confirmed by RT-PCR), and have trained an adaboost classifier on 11 novel features of the sequence data. The resulting classifier has an estimated value of 0.91 for the area under the ROC curve. We have used deFuse to discover gene fusions in 40 ovarian tumor samples, one ovarian cancer cell line, and three sarcoma samples. We report herein the first gene fusions discovered in ovarian cancer. We conclude that gene fusions are not infrequent events in ovarian cancer and that these events have the potential to substantially alter the expression patterns of the genes involved; gene fusions should therefore be considered in efforts to comprehensively characterize the mutational profiles of ovarian cancer transcriptomes.Andrew McPhersonFereydoun HormozdiariAbdalnasser ZayedRyan GiulianyGavin HaMark G F SunMalachi GriffithAlireza Heravi MoussaviJanine SenzNataliya MelnykMarina PachecoMarco A MarraMartin HirstTorsten O NielsenS Cenk SahinalpDavid HuntsmanSohrab P ShahPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 7, Iss 5, p e1001138 (2011)
institution DOAJ
collection DOAJ
language EN
topic Biology (General)
QH301-705.5
spellingShingle Biology (General)
QH301-705.5
Andrew McPherson
Fereydoun Hormozdiari
Abdalnasser Zayed
Ryan Giuliany
Gavin Ha
Mark G F Sun
Malachi Griffith
Alireza Heravi Moussavi
Janine Senz
Nataliya Melnyk
Marina Pacheco
Marco A Marra
Martin Hirst
Torsten O Nielsen
S Cenk Sahinalp
David Huntsman
Sohrab P Shah
deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.
description Gene fusions created by somatic genomic rearrangements are known to play an important role in the onset and development of some cancers, such as lymphomas and sarcomas. RNA-Seq (whole transcriptome shotgun sequencing) is proving to be a useful tool for the discovery of novel gene fusions in cancer transcriptomes. However, algorithmic methods for the discovery of gene fusions using RNA-Seq data remain underdeveloped. We have developed deFuse, a novel computational method for fusion discovery in tumor RNA-Seq data. Unlike existing methods that use only unique best-hit alignments and consider only fusion boundaries at the ends of known exons, deFuse considers all alignments and all possible locations for fusion boundaries. As a result, deFuse is able to identify fusion sequences with demonstrably better sensitivity than previous approaches. To increase the specificity of our approach, we curated a list of 60 true positive and 61 true negative fusion sequences (as confirmed by RT-PCR), and have trained an adaboost classifier on 11 novel features of the sequence data. The resulting classifier has an estimated value of 0.91 for the area under the ROC curve. We have used deFuse to discover gene fusions in 40 ovarian tumor samples, one ovarian cancer cell line, and three sarcoma samples. We report herein the first gene fusions discovered in ovarian cancer. We conclude that gene fusions are not infrequent events in ovarian cancer and that these events have the potential to substantially alter the expression patterns of the genes involved; gene fusions should therefore be considered in efforts to comprehensively characterize the mutational profiles of ovarian cancer transcriptomes.
format article
author Andrew McPherson
Fereydoun Hormozdiari
Abdalnasser Zayed
Ryan Giuliany
Gavin Ha
Mark G F Sun
Malachi Griffith
Alireza Heravi Moussavi
Janine Senz
Nataliya Melnyk
Marina Pacheco
Marco A Marra
Martin Hirst
Torsten O Nielsen
S Cenk Sahinalp
David Huntsman
Sohrab P Shah
author_facet Andrew McPherson
Fereydoun Hormozdiari
Abdalnasser Zayed
Ryan Giuliany
Gavin Ha
Mark G F Sun
Malachi Griffith
Alireza Heravi Moussavi
Janine Senz
Nataliya Melnyk
Marina Pacheco
Marco A Marra
Martin Hirst
Torsten O Nielsen
S Cenk Sahinalp
David Huntsman
Sohrab P Shah
author_sort Andrew McPherson
title deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.
title_short deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.
title_full deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.
title_fullStr deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.
title_full_unstemmed deFuse: an algorithm for gene fusion discovery in tumor RNA-Seq data.
title_sort defuse: an algorithm for gene fusion discovery in tumor rna-seq data.
publisher Public Library of Science (PLoS)
publishDate 2011
url https://doaj.org/article/c9b389b739544cc28b879c2f050782a1
work_keys_str_mv AT andrewmcpherson defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT fereydounhormozdiari defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT abdalnasserzayed defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT ryangiuliany defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT gavinha defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT markgfsun defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT malachigriffith defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT alirezaheravimoussavi defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT janinesenz defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT nataliyamelnyk defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT marinapacheco defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT marcoamarra defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT martinhirst defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT torstenonielsen defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT scenksahinalp defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT davidhuntsman defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
AT sohrabpshah defuseanalgorithmforgenefusiondiscoveryintumorrnaseqdata
_version_ 1718424815608528896