Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains

ABSTRACT Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Jenny L. Draper, Lori M. Hansen, David L. Bernick, Samar Abedrabbo, Jason G. Underwood, Nguyet Kong, Bihua C. Huang, Allison M. Weis, Bart C. Weimer, Arnoud H. M. van Vliet, Nader Pourmand, Jay V. Solnick, Kevin Karplus, Karen M. Ottemann
Formato: article
Lenguaje:EN
Publicado: American Society for Microbiology 2017
Materias:
Acceso en línea:https://doaj.org/article/01a452ee2eb94a0f9d78d0252bbc821b
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:01a452ee2eb94a0f9d78d0252bbc821b
record_format dspace
spelling oai:doaj.org-article:01a452ee2eb94a0f9d78d0252bbc821b2021-11-15T15:51:07ZFallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains10.1128/mBio.02321-162150-7511https://doaj.org/article/01a452ee2eb94a0f9d78d0252bbc821b2017-03-01T00:00:00Zhttps://journals.asm.org/doi/10.1128/mBio.02321-16https://doaj.org/toc/2150-7511ABSTRACT Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. IMPORTANCE Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish “the genome” of a bacterial strain. Variability is usually reduced (“only sequence from a single colony”), ignored (“just publish the consensus”), or placed in the “too-hard” basket (“analysis of raw read data is more robust”). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen Helicobacter pylori model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as “the genome” of a bacterial strain may be misleading.Jenny L. DraperLori M. HansenDavid L. BernickSamar AbedrabboJason G. UnderwoodNguyet KongBihua C. HuangAllison M. WeisBart C. WeimerArnoud H. M. van VlietNader PourmandJay V. SolnickKevin KarplusKaren M. OttemannAmerican Society for MicrobiologyarticleMicrobiologyQR1-502ENmBio, Vol 8, Iss 1 (2017)
institution DOAJ
collection DOAJ
language EN
topic Microbiology
QR1-502
spellingShingle Microbiology
QR1-502
Jenny L. Draper
Lori M. Hansen
David L. Bernick
Samar Abedrabbo
Jason G. Underwood
Nguyet Kong
Bihua C. Huang
Allison M. Weis
Bart C. Weimer
Arnoud H. M. van Vliet
Nader Pourmand
Jay V. Solnick
Kevin Karplus
Karen M. Ottemann
Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains
description ABSTRACT Many bacterial genomes are highly variable but nonetheless are typically published as a single assembled genome. Experiments tracking bacterial genome evolution have not looked at the variation present at a given point in time. Here, we analyzed the mouse-passaged Helicobacter pylori strain SS1 and its parent PMSS1 to assess intra- and intergenomic variability. Using high sequence coverage depth and experimental validation, we detected extensive genome plasticity within these H. pylori isolates, including movement of the transposable element IS607, large and small inversions, multiple single nucleotide polymorphisms, and variation in cagA copy number. The cagA gene was found as 1 to 4 tandem copies located off the cag island in both SS1 and PMSS1; this copy number variation correlated with protein expression. To gain insight into the changes that occurred during mouse adaptation, we also compared SS1 and PMSS1 and observed 46 differences that were distinct from the within-genome variation. The most substantial was an insertion in cagY, which encodes a protein required for a type IV secretion system function. We detected modifications in genes coding for two proteins known to affect mouse colonization, the HpaA neuraminyllactose-binding protein and the FutB α-1,3 lipopolysaccharide (LPS) fucosyltransferase, as well as genes predicted to modulate diverse properties. In sum, our work suggests that data from consensus genome assemblies from single colonies may be misleading by failing to represent the variability present. Furthermore, we show that high-depth genomic sequencing data of a population can be analyzed to gain insight into the normal variation within bacterial strains. IMPORTANCE Although it is well known that many bacterial genomes are highly variable, it is nonetheless traditional to refer to, analyze, and publish “the genome” of a bacterial strain. Variability is usually reduced (“only sequence from a single colony”), ignored (“just publish the consensus”), or placed in the “too-hard” basket (“analysis of raw read data is more robust”). Now that whole-genome sequences are regularly used to assess virulence and track outbreaks, a better understanding of the baseline genomic variation present within single strains is needed. Here, we describe the variability seen in typical working stocks and colonies of pathogen Helicobacter pylori model strains SS1 and PMSS1 as revealed by use of high-coverage mate pair next-generation sequencing (NGS) and confirmed by traditional laboratory techniques. This work demonstrates that reliance on a consensus assembly as “the genome” of a bacterial strain may be misleading.
format article
author Jenny L. Draper
Lori M. Hansen
David L. Bernick
Samar Abedrabbo
Jason G. Underwood
Nguyet Kong
Bihua C. Huang
Allison M. Weis
Bart C. Weimer
Arnoud H. M. van Vliet
Nader Pourmand
Jay V. Solnick
Kevin Karplus
Karen M. Ottemann
author_facet Jenny L. Draper
Lori M. Hansen
David L. Bernick
Samar Abedrabbo
Jason G. Underwood
Nguyet Kong
Bihua C. Huang
Allison M. Weis
Bart C. Weimer
Arnoud H. M. van Vliet
Nader Pourmand
Jay V. Solnick
Kevin Karplus
Karen M. Ottemann
author_sort Jenny L. Draper
title Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains
title_short Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains
title_full Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains
title_fullStr Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains
title_full_unstemmed Fallacy of the Unique Genome: Sequence Diversity within Single <italic toggle="yes">Helicobacter pylori</italic> Strains
title_sort fallacy of the unique genome: sequence diversity within single <italic toggle="yes">helicobacter pylori</italic> strains
publisher American Society for Microbiology
publishDate 2017
url https://doaj.org/article/01a452ee2eb94a0f9d78d0252bbc821b
work_keys_str_mv AT jennyldraper fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT lorimhansen fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT davidlbernick fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT samarabedrabbo fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT jasongunderwood fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT nguyetkong fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT bihuachuang fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT allisonmweis fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT bartcweimer fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT arnoudhmvanvliet fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT naderpourmand fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT jayvsolnick fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT kevinkarplus fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
AT karenmottemann fallacyoftheuniquegenomesequencediversitywithinsingleitalictoggleyeshelicobacterpyloriitalicstrains
_version_ 1718427377495703552