A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.

The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated duri...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Guojun Yu, Yingru Wu, Zhi Duan, Catherine Tang, Haipeng Xing, Matthew D Scharff, Thomas MacCarthy
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
Acceso en línea:https://doaj.org/article/25eefab5715547d3ad7ca2a44510f15a
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:25eefab5715547d3ad7ca2a44510f15a
record_format dspace
spelling oai:doaj.org-article:25eefab5715547d3ad7ca2a44510f15a2021-12-02T19:57:50ZA Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.1553-734X1553-735810.1371/journal.pcbi.1009323https://doaj.org/article/25eefab5715547d3ad7ca2a44510f15a2021-09-01T00:00:00Zhttps://doi.org/10.1371/journal.pcbi.1009323https://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated during transcription. Attempts have been made to correlate SHM with ssDNA using bisulfite to chemically convert cytosines that are accessible in the intact chromatin of mutating B cells. These studies have been complicated by using different definitions of "bisulfite accessible regions" (BARs). Recently, deep-sequencing has provided much larger datasets of such regions but computational methods are needed to enable this analysis. Here we leveraged the deep-sequencing approach with unique molecular identifiers and developed a novel Hidden Markov Model based Bayesian Segmentation algorithm to characterize the ssDNA regions in the IGHV4-34 gene of the human Ramos B cell line. Combining hierarchical clustering and our new Bayesian model, we identified recurrent BARs in certain subregions of both top and bottom strands of this gene. Using this new system, the average size of BARs is about 15 bp. We also identified potential G-quadruplex DNA structures in this gene and found that the BARs co-locate with G-quadruplex structures in the opposite strand. Using various correlation analyses, there is not a direct site-to-site relationship between the bisulfite accessible ssDNA and all sites of SHM but most of the highly AID mutated sites are within 15 bp of a BAR. In summary, we developed a novel platform to study single stranded DNA in chromatin at a base pair resolution that reveals potential relationships among BARs, SHM and G-quadruplexes. This platform could be applied to genome wide studies in the future.Guojun YuYingru WuZhi DuanCatherine TangHaipeng XingMatthew D ScharffThomas MacCarthyPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 17, Iss 9, p e1009323 (2021)
institution DOAJ
collection DOAJ
language EN
topic Biology (General)
QH301-705.5
spellingShingle Biology (General)
QH301-705.5
Guojun Yu
Yingru Wu
Zhi Duan
Catherine Tang
Haipeng Xing
Matthew D Scharff
Thomas MacCarthy
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
description The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated during transcription. Attempts have been made to correlate SHM with ssDNA using bisulfite to chemically convert cytosines that are accessible in the intact chromatin of mutating B cells. These studies have been complicated by using different definitions of "bisulfite accessible regions" (BARs). Recently, deep-sequencing has provided much larger datasets of such regions but computational methods are needed to enable this analysis. Here we leveraged the deep-sequencing approach with unique molecular identifiers and developed a novel Hidden Markov Model based Bayesian Segmentation algorithm to characterize the ssDNA regions in the IGHV4-34 gene of the human Ramos B cell line. Combining hierarchical clustering and our new Bayesian model, we identified recurrent BARs in certain subregions of both top and bottom strands of this gene. Using this new system, the average size of BARs is about 15 bp. We also identified potential G-quadruplex DNA structures in this gene and found that the BARs co-locate with G-quadruplex structures in the opposite strand. Using various correlation analyses, there is not a direct site-to-site relationship between the bisulfite accessible ssDNA and all sites of SHM but most of the highly AID mutated sites are within 15 bp of a BAR. In summary, we developed a novel platform to study single stranded DNA in chromatin at a base pair resolution that reveals potential relationships among BARs, SHM and G-quadruplexes. This platform could be applied to genome wide studies in the future.
format article
author Guojun Yu
Yingru Wu
Zhi Duan
Catherine Tang
Haipeng Xing
Matthew D Scharff
Thomas MacCarthy
author_facet Guojun Yu
Yingru Wu
Zhi Duan
Catherine Tang
Haipeng Xing
Matthew D Scharff
Thomas MacCarthy
author_sort Guojun Yu
title A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
title_short A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
title_full A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
title_fullStr A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
title_full_unstemmed A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
title_sort bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded dna in chromatin and somatic hypermutation of immunoglobulin genes.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/25eefab5715547d3ad7ca2a44510f15a
work_keys_str_mv AT guojunyu abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT yingruwu abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT zhiduan abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT catherinetang abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT haipengxing abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT matthewdscharff abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT thomasmaccarthy abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT guojunyu bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT yingruwu bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT zhiduan bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT catherinetang bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT haipengxing bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT matthewdscharff bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
AT thomasmaccarthy bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes
_version_ 1718375768044601344