A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.
The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated duri...
Guardado en:
Autores principales: | , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Public Library of Science (PLoS)
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/25eefab5715547d3ad7ca2a44510f15a |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:25eefab5715547d3ad7ca2a44510f15a |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:25eefab5715547d3ad7ca2a44510f15a2021-12-02T19:57:50ZA Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes.1553-734X1553-735810.1371/journal.pcbi.1009323https://doaj.org/article/25eefab5715547d3ad7ca2a44510f15a2021-09-01T00:00:00Zhttps://doi.org/10.1371/journal.pcbi.1009323https://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated during transcription. Attempts have been made to correlate SHM with ssDNA using bisulfite to chemically convert cytosines that are accessible in the intact chromatin of mutating B cells. These studies have been complicated by using different definitions of "bisulfite accessible regions" (BARs). Recently, deep-sequencing has provided much larger datasets of such regions but computational methods are needed to enable this analysis. Here we leveraged the deep-sequencing approach with unique molecular identifiers and developed a novel Hidden Markov Model based Bayesian Segmentation algorithm to characterize the ssDNA regions in the IGHV4-34 gene of the human Ramos B cell line. Combining hierarchical clustering and our new Bayesian model, we identified recurrent BARs in certain subregions of both top and bottom strands of this gene. Using this new system, the average size of BARs is about 15 bp. We also identified potential G-quadruplex DNA structures in this gene and found that the BARs co-locate with G-quadruplex structures in the opposite strand. Using various correlation analyses, there is not a direct site-to-site relationship between the bisulfite accessible ssDNA and all sites of SHM but most of the highly AID mutated sites are within 15 bp of a BAR. In summary, we developed a novel platform to study single stranded DNA in chromatin at a base pair resolution that reveals potential relationships among BARs, SHM and G-quadruplexes. This platform could be applied to genome wide studies in the future.Guojun YuYingru WuZhi DuanCatherine TangHaipeng XingMatthew D ScharffThomas MacCarthyPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 17, Iss 9, p e1009323 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Biology (General) QH301-705.5 |
spellingShingle |
Biology (General) QH301-705.5 Guojun Yu Yingru Wu Zhi Duan Catherine Tang Haipeng Xing Matthew D Scharff Thomas MacCarthy A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. |
description |
The B cells in our body generate protective antibodies by introducing somatic hypermutations (SHM) into the variable region of immunoglobulin genes (IgVs). The mutations are generated by activation induced deaminase (AID) that converts cytosine to uracil in single stranded DNA (ssDNA) generated during transcription. Attempts have been made to correlate SHM with ssDNA using bisulfite to chemically convert cytosines that are accessible in the intact chromatin of mutating B cells. These studies have been complicated by using different definitions of "bisulfite accessible regions" (BARs). Recently, deep-sequencing has provided much larger datasets of such regions but computational methods are needed to enable this analysis. Here we leveraged the deep-sequencing approach with unique molecular identifiers and developed a novel Hidden Markov Model based Bayesian Segmentation algorithm to characterize the ssDNA regions in the IGHV4-34 gene of the human Ramos B cell line. Combining hierarchical clustering and our new Bayesian model, we identified recurrent BARs in certain subregions of both top and bottom strands of this gene. Using this new system, the average size of BARs is about 15 bp. We also identified potential G-quadruplex DNA structures in this gene and found that the BARs co-locate with G-quadruplex structures in the opposite strand. Using various correlation analyses, there is not a direct site-to-site relationship between the bisulfite accessible ssDNA and all sites of SHM but most of the highly AID mutated sites are within 15 bp of a BAR. In summary, we developed a novel platform to study single stranded DNA in chromatin at a base pair resolution that reveals potential relationships among BARs, SHM and G-quadruplexes. This platform could be applied to genome wide studies in the future. |
format |
article |
author |
Guojun Yu Yingru Wu Zhi Duan Catherine Tang Haipeng Xing Matthew D Scharff Thomas MacCarthy |
author_facet |
Guojun Yu Yingru Wu Zhi Duan Catherine Tang Haipeng Xing Matthew D Scharff Thomas MacCarthy |
author_sort |
Guojun Yu |
title |
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. |
title_short |
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. |
title_full |
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. |
title_fullStr |
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. |
title_full_unstemmed |
A Bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded DNA in chromatin and somatic hypermutation of immunoglobulin genes. |
title_sort |
bayesian model based computational analysis of the relationship between bisulfite accessible single-stranded dna in chromatin and somatic hypermutation of immunoglobulin genes. |
publisher |
Public Library of Science (PLoS) |
publishDate |
2021 |
url |
https://doaj.org/article/25eefab5715547d3ad7ca2a44510f15a |
work_keys_str_mv |
AT guojunyu abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT yingruwu abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT zhiduan abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT catherinetang abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT haipengxing abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT matthewdscharff abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT thomasmaccarthy abayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT guojunyu bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT yingruwu bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT zhiduan bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT catherinetang bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT haipengxing bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT matthewdscharff bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes AT thomasmaccarthy bayesianmodelbasedcomputationalanalysisoftherelationshipbetweenbisulfiteaccessiblesinglestrandeddnainchromatinandsomatichypermutationofimmunoglobulingenes |
_version_ |
1718375768044601344 |