The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data

ABSTRACT PCR amplification of 16S rRNA genes is a critical yet underappreciated step in the generation of sequence data to describe the taxonomic composition of microbial communities. Numerous factors in the design of PCR can impact the sequencing error rate, the abundance of chimeric sequences, and...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Marc A. Sze, Patrick D. Schloss
Formato: article
Lenguaje:EN
Publicado: American Society for Microbiology 2019
Materias:
PCR
Acceso en línea:https://doaj.org/article/a7babaccad4f4b638fea61474f1b59d2
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:a7babaccad4f4b638fea61474f1b59d2
record_format dspace
spelling oai:doaj.org-article:a7babaccad4f4b638fea61474f1b59d22021-11-15T15:22:20ZThe Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data10.1128/mSphere.00163-192379-5042https://doaj.org/article/a7babaccad4f4b638fea61474f1b59d22019-06-01T00:00:00Zhttps://journals.asm.org/doi/10.1128/mSphere.00163-19https://doaj.org/toc/2379-5042ABSTRACT PCR amplification of 16S rRNA genes is a critical yet underappreciated step in the generation of sequence data to describe the taxonomic composition of microbial communities. Numerous factors in the design of PCR can impact the sequencing error rate, the abundance of chimeric sequences, and the degree to which the fragments in the product represent their abundance in the original sample (i.e., bias). We compared the performance of high fidelity polymerases and various numbers of rounds of amplification when amplifying a mock community and human stool samples. Although it was impossible to derive specific recommendations, we did observe general trends. Namely, using a polymerase with the highest possible fidelity and minimizing the number of rounds of PCR reduced the sequencing error rate, fraction of chimeric sequences, and bias. Evidence of bias at the sequence level was subtle and could not be ascribed to the fragments’ fraction of bases that were guanines or cytosines. When analyzing mock community data, the amount that the community deviated from the expected composition increased with the number of rounds of PCR. This bias was inconsistent for human stool samples. Overall, the results underscore the difficulty of comparing sequence data that are generated by different PCR protocols. However, the results indicate that the variation in human stool samples is generally larger than that introduced by the choice of polymerase or number of rounds of PCR. IMPORTANCE A steep decline in sequencing costs drove an explosion in studies characterizing microbial communities from diverse environments. Although a significant amount of effort has gone into understanding the error profiles of DNA sequencers, little has been done to understand the downstream effects of the PCR amplification protocol. We quantified the effects of the choice of polymerase and number of PCR cycles on the quality of downstream data. We found that these choices can have a profound impact on the way that a microbial community is represented in the sequence data. The effects are relatively small compared to the variation in human stool samples; however, care should be taken to use polymerases with the highest possible fidelity and to minimize the number of rounds of PCR. These results also underscore that it is not possible to directly compare sequence data generated under different PCR conditions.Marc A. SzePatrick D. SchlossAmerican Society for Microbiologyarticle16S rRNA genePCRbiasenvironmental microbiologymicrobial ecologymicrobiomeMicrobiologyQR1-502ENmSphere, Vol 4, Iss 3 (2019)
institution DOAJ
collection DOAJ
language EN
topic 16S rRNA gene
PCR
bias
environmental microbiology
microbial ecology
microbiome
Microbiology
QR1-502
spellingShingle 16S rRNA gene
PCR
bias
environmental microbiology
microbial ecology
microbiome
Microbiology
QR1-502
Marc A. Sze
Patrick D. Schloss
The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data
description ABSTRACT PCR amplification of 16S rRNA genes is a critical yet underappreciated step in the generation of sequence data to describe the taxonomic composition of microbial communities. Numerous factors in the design of PCR can impact the sequencing error rate, the abundance of chimeric sequences, and the degree to which the fragments in the product represent their abundance in the original sample (i.e., bias). We compared the performance of high fidelity polymerases and various numbers of rounds of amplification when amplifying a mock community and human stool samples. Although it was impossible to derive specific recommendations, we did observe general trends. Namely, using a polymerase with the highest possible fidelity and minimizing the number of rounds of PCR reduced the sequencing error rate, fraction of chimeric sequences, and bias. Evidence of bias at the sequence level was subtle and could not be ascribed to the fragments’ fraction of bases that were guanines or cytosines. When analyzing mock community data, the amount that the community deviated from the expected composition increased with the number of rounds of PCR. This bias was inconsistent for human stool samples. Overall, the results underscore the difficulty of comparing sequence data that are generated by different PCR protocols. However, the results indicate that the variation in human stool samples is generally larger than that introduced by the choice of polymerase or number of rounds of PCR. IMPORTANCE A steep decline in sequencing costs drove an explosion in studies characterizing microbial communities from diverse environments. Although a significant amount of effort has gone into understanding the error profiles of DNA sequencers, little has been done to understand the downstream effects of the PCR amplification protocol. We quantified the effects of the choice of polymerase and number of PCR cycles on the quality of downstream data. We found that these choices can have a profound impact on the way that a microbial community is represented in the sequence data. The effects are relatively small compared to the variation in human stool samples; however, care should be taken to use polymerases with the highest possible fidelity and to minimize the number of rounds of PCR. These results also underscore that it is not possible to directly compare sequence data generated under different PCR conditions.
format article
author Marc A. Sze
Patrick D. Schloss
author_facet Marc A. Sze
Patrick D. Schloss
author_sort Marc A. Sze
title The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data
title_short The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data
title_full The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data
title_fullStr The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data
title_full_unstemmed The Impact of DNA Polymerase and Number of Rounds of Amplification in PCR on 16S rRNA Gene Sequence Data
title_sort impact of dna polymerase and number of rounds of amplification in pcr on 16s rrna gene sequence data
publisher American Society for Microbiology
publishDate 2019
url https://doaj.org/article/a7babaccad4f4b638fea61474f1b59d2
work_keys_str_mv AT marcasze theimpactofdnapolymeraseandnumberofroundsofamplificationinpcron16srrnagenesequencedata
AT patrickdschloss theimpactofdnapolymeraseandnumberofroundsofamplificationinpcron16srrnagenesequencedata
AT marcasze impactofdnapolymeraseandnumberofroundsofamplificationinpcron16srrnagenesequencedata
AT patrickdschloss impactofdnapolymeraseandnumberofroundsofamplificationinpcron16srrnagenesequencedata
_version_ 1718428023425859584