FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses

ABSTRACT Quantification tools for RNA sequencing (RNA-Seq) analyses are often designed and tested using human transcriptomics data sets, in which full-length transcript sequences are well annotated. For prokaryotic transcriptomics experiments, full-length transcript sequences are seldom known, and c...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Matthew Chung, Ricky S. Adkins, John S. A. Mattick, Katie R. Bradwell, Amol C. Shetty, Lisa Sadzewicz, Luke J. Tallon, Claire M. Fraser, David A. Rasko, Anup Mahurkar, Julie C. Dunning Hotopp
Formato: article
Lenguaje:EN
Publicado: American Society for Microbiology 2021
Materias:
Acceso en línea:https://doaj.org/article/b4053ee518834536913a290b2a6553e9
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:b4053ee518834536913a290b2a6553e9
record_format dspace
spelling oai:doaj.org-article:b4053ee518834536913a290b2a6553e92021-12-02T18:20:19ZFADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses10.1128/mSystems.00917-202379-5077https://doaj.org/article/b4053ee518834536913a290b2a6553e92021-02-01T00:00:00Zhttps://journals.asm.org/doi/10.1128/mSystems.00917-20https://doaj.org/toc/2379-5077ABSTRACT Quantification tools for RNA sequencing (RNA-Seq) analyses are often designed and tested using human transcriptomics data sets, in which full-length transcript sequences are well annotated. For prokaryotic transcriptomics experiments, full-length transcript sequences are seldom known, and coding sequences must instead be used for quantification steps in RNA-Seq analyses. However, operons confound accurate quantification of coding sequences since a single transcript does not necessarily equate to a single gene. Here, we introduce FADU (Feature Aggregate Depth Utility), a quantification tool designed specifically for prokaryotic RNA-Seq analyses. FADU assigns partial count values proportional to the length of the fragment overlapping the target feature. To assess the ability of FADU to quantify genes in prokaryotic transcriptomics analyses, we compared its performance to those of eXpress, featureCounts, HTSeq, kallisto, and Salmon across three paired-end read data sets of (i) Ehrlichia chaffeensis, (ii) Escherichia coli, and (iii) the Wolbachia endosymbiont wBm. Across each of the three data sets, we find that FADU can more accurately quantify operonic genes by deriving proportional counts for multigene fragments within operons. FADU is available at https://github.com/IGS/FADU. IMPORTANCE Most currently available quantification tools for transcriptomics analyses have been designed for human data sets, in which full-length transcript sequences, including the untranslated regions, are well annotated. In most prokaryotic systems, full-length transcript sequences have yet to be characterized, leading to prokaryotic transcriptomics analyses being performed based on only the coding sequences. In contrast to eukaryotes, prokaryotes contain polycistronic transcripts, and when genes are quantified based on coding sequences instead of transcript sequences, this leads to an increased abundance of improperly assigned ambiguous multigene fragments, specifically those mapping to multiple genes in operons. Here, we describe FADU, a quantification tool for prokaryotic RNA-Seq analyses designed to assign proportional counts with the purpose of better quantifying operonic genes while minimizing the pitfalls associated with improperly assigning fragment counts from ambiguous transcripts.Matthew ChungRicky S. AdkinsJohn S. A. MattickKatie R. BradwellAmol C. ShettyLisa SadzewiczLuke J. TallonClaire M. FraserDavid A. RaskoAnup MahurkarJulie C. Dunning HotoppAmerican Society for Microbiologyarticlebacteriadifferential expressionoperonpolycistronic transcriptsread countsoftwareMicrobiologyQR1-502ENmSystems, Vol 6, Iss 1 (2021)
institution DOAJ
collection DOAJ
language EN
topic bacteria
differential expression
operon
polycistronic transcripts
read count
software
Microbiology
QR1-502
spellingShingle bacteria
differential expression
operon
polycistronic transcripts
read count
software
Microbiology
QR1-502
Matthew Chung
Ricky S. Adkins
John S. A. Mattick
Katie R. Bradwell
Amol C. Shetty
Lisa Sadzewicz
Luke J. Tallon
Claire M. Fraser
David A. Rasko
Anup Mahurkar
Julie C. Dunning Hotopp
FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses
description ABSTRACT Quantification tools for RNA sequencing (RNA-Seq) analyses are often designed and tested using human transcriptomics data sets, in which full-length transcript sequences are well annotated. For prokaryotic transcriptomics experiments, full-length transcript sequences are seldom known, and coding sequences must instead be used for quantification steps in RNA-Seq analyses. However, operons confound accurate quantification of coding sequences since a single transcript does not necessarily equate to a single gene. Here, we introduce FADU (Feature Aggregate Depth Utility), a quantification tool designed specifically for prokaryotic RNA-Seq analyses. FADU assigns partial count values proportional to the length of the fragment overlapping the target feature. To assess the ability of FADU to quantify genes in prokaryotic transcriptomics analyses, we compared its performance to those of eXpress, featureCounts, HTSeq, kallisto, and Salmon across three paired-end read data sets of (i) Ehrlichia chaffeensis, (ii) Escherichia coli, and (iii) the Wolbachia endosymbiont wBm. Across each of the three data sets, we find that FADU can more accurately quantify operonic genes by deriving proportional counts for multigene fragments within operons. FADU is available at https://github.com/IGS/FADU. IMPORTANCE Most currently available quantification tools for transcriptomics analyses have been designed for human data sets, in which full-length transcript sequences, including the untranslated regions, are well annotated. In most prokaryotic systems, full-length transcript sequences have yet to be characterized, leading to prokaryotic transcriptomics analyses being performed based on only the coding sequences. In contrast to eukaryotes, prokaryotes contain polycistronic transcripts, and when genes are quantified based on coding sequences instead of transcript sequences, this leads to an increased abundance of improperly assigned ambiguous multigene fragments, specifically those mapping to multiple genes in operons. Here, we describe FADU, a quantification tool for prokaryotic RNA-Seq analyses designed to assign proportional counts with the purpose of better quantifying operonic genes while minimizing the pitfalls associated with improperly assigning fragment counts from ambiguous transcripts.
format article
author Matthew Chung
Ricky S. Adkins
John S. A. Mattick
Katie R. Bradwell
Amol C. Shetty
Lisa Sadzewicz
Luke J. Tallon
Claire M. Fraser
David A. Rasko
Anup Mahurkar
Julie C. Dunning Hotopp
author_facet Matthew Chung
Ricky S. Adkins
John S. A. Mattick
Katie R. Bradwell
Amol C. Shetty
Lisa Sadzewicz
Luke J. Tallon
Claire M. Fraser
David A. Rasko
Anup Mahurkar
Julie C. Dunning Hotopp
author_sort Matthew Chung
title FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses
title_short FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses
title_full FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses
title_fullStr FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses
title_full_unstemmed FADU: a Quantification Tool for Prokaryotic Transcriptomic Analyses
title_sort fadu: a quantification tool for prokaryotic transcriptomic analyses
publisher American Society for Microbiology
publishDate 2021
url https://doaj.org/article/b4053ee518834536913a290b2a6553e9
work_keys_str_mv AT matthewchung faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT rickysadkins faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT johnsamattick faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT katierbradwell faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT amolcshetty faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT lisasadzewicz faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT lukejtallon faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT clairemfraser faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT davidarasko faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT anupmahurkar faduaquantificationtoolforprokaryotictranscriptomicanalyses
AT juliecdunninghotopp faduaquantificationtoolforprokaryotictranscriptomicanalyses
_version_ 1718378177582071808