Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.

Analysis of microbial communities by high-throughput pyrosequencing of SSU rRNA gene PCR amplicons has transformed microbial ecology research and led to the observation that many communities contain a diverse assortment of rare taxa-a phenomenon termed the Rare Biosphere. Multiple studies have inves...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Charles K Lee, Craig W Herbold, Shawn W Polson, K Eric Wommack, Shannon J Williamson, Ian R McDonald, S Craig Cary
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2012
Materias:
R
Q
Acceso en línea:https://doaj.org/article/4cd79a666d4a406e823c45e048c8e514
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:4cd79a666d4a406e823c45e048c8e514
record_format dspace
spelling oai:doaj.org-article:4cd79a666d4a406e823c45e048c8e5142021-11-18T07:06:25ZGroundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.1932-620310.1371/journal.pone.0044224https://doaj.org/article/4cd79a666d4a406e823c45e048c8e5142012-01-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/pmid/22970184/?tool=EBIhttps://doaj.org/toc/1932-6203Analysis of microbial communities by high-throughput pyrosequencing of SSU rRNA gene PCR amplicons has transformed microbial ecology research and led to the observation that many communities contain a diverse assortment of rare taxa-a phenomenon termed the Rare Biosphere. Multiple studies have investigated the effect of pyrosequencing read quality on operational taxonomic unit (OTU) richness for contrived communities, yet there is limited information on the fidelity of community structure estimates obtained through this approach. Given that PCR biases are widely recognized, and further unknown biases may arise from the sequencing process itself, a priori assumptions about the neutrality of the data generation process are at best unvalidated. Furthermore, post-sequencing quality control algorithms have not been explicitly evaluated for the accuracy of recovered representative sequences and its impact on downstream analyses, reducing useful discussion on pyrosequencing reads to their diversity and abundances. Here we report on community structures and sequences recovered for in vitro-simulated communities consisting of twenty 16S rRNA gene clones tiered at known proportions. PCR amplicon libraries of the V3-V4 and V6 hypervariable regions from the in vitro-simulated communities were sequenced using the Roche 454 GS FLX Titanium platform. Commonly used quality control protocols resulted in the formation of OTUs with >1% abundance composed entirely of erroneous sequences, while over-aggressive clustering approaches obfuscated real, expected OTUs. The pyrosequencing process itself did not appear to impose significant biases on overall community structure estimates, although the detection limit for rare taxa may be affected by PCR amplicon size and quality control approach employed. Meanwhile, PCR biases associated with the initial amplicon generation may impose greater distortions in the observed community structure.Charles K LeeCraig W HerboldShawn W PolsonK Eric WommackShannon J WilliamsonIan R McDonaldS Craig CaryPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 7, Iss 9, p e44224 (2012)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Charles K Lee
Craig W Herbold
Shawn W Polson
K Eric Wommack
Shannon J Williamson
Ian R McDonald
S Craig Cary
Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.
description Analysis of microbial communities by high-throughput pyrosequencing of SSU rRNA gene PCR amplicons has transformed microbial ecology research and led to the observation that many communities contain a diverse assortment of rare taxa-a phenomenon termed the Rare Biosphere. Multiple studies have investigated the effect of pyrosequencing read quality on operational taxonomic unit (OTU) richness for contrived communities, yet there is limited information on the fidelity of community structure estimates obtained through this approach. Given that PCR biases are widely recognized, and further unknown biases may arise from the sequencing process itself, a priori assumptions about the neutrality of the data generation process are at best unvalidated. Furthermore, post-sequencing quality control algorithms have not been explicitly evaluated for the accuracy of recovered representative sequences and its impact on downstream analyses, reducing useful discussion on pyrosequencing reads to their diversity and abundances. Here we report on community structures and sequences recovered for in vitro-simulated communities consisting of twenty 16S rRNA gene clones tiered at known proportions. PCR amplicon libraries of the V3-V4 and V6 hypervariable regions from the in vitro-simulated communities were sequenced using the Roche 454 GS FLX Titanium platform. Commonly used quality control protocols resulted in the formation of OTUs with >1% abundance composed entirely of erroneous sequences, while over-aggressive clustering approaches obfuscated real, expected OTUs. The pyrosequencing process itself did not appear to impose significant biases on overall community structure estimates, although the detection limit for rare taxa may be affected by PCR amplicon size and quality control approach employed. Meanwhile, PCR biases associated with the initial amplicon generation may impose greater distortions in the observed community structure.
format article
author Charles K Lee
Craig W Herbold
Shawn W Polson
K Eric Wommack
Shannon J Williamson
Ian R McDonald
S Craig Cary
author_facet Charles K Lee
Craig W Herbold
Shawn W Polson
K Eric Wommack
Shannon J Williamson
Ian R McDonald
S Craig Cary
author_sort Charles K Lee
title Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.
title_short Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.
title_full Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.
title_fullStr Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.
title_full_unstemmed Groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from PCR amplicon pyrosequencing.
title_sort groundtruthing next-gen sequencing for microbial ecology-biases and errors in community structure estimates from pcr amplicon pyrosequencing.
publisher Public Library of Science (PLoS)
publishDate 2012
url https://doaj.org/article/4cd79a666d4a406e823c45e048c8e514
work_keys_str_mv AT charlesklee groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
AT craigwherbold groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
AT shawnwpolson groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
AT kericwommack groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
AT shannonjwilliamson groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
AT ianrmcdonald groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
AT scraigcary groundtruthingnextgensequencingformicrobialecologybiasesanderrorsincommunitystructureestimatesfrompcrampliconpyrosequencing
_version_ 1718423913297346560