High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.

The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The med...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Wesley M Marin, Ravi Dandekar, Danillo G Augusto, Tasneem Yusufali, Bianca Heyn, Jan Hofmann, Vinzenz Lange, Jürgen Sauter, Paul J Norman, Jill A Hollenbach
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
Acceso en línea:https://doaj.org/article/4880db1e9f7e492dafc5a56310cfa057
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:4880db1e9f7e492dafc5a56310cfa057
record_format dspace
spelling oai:doaj.org-article:4880db1e9f7e492dafc5a56310cfa0572021-12-02T19:58:09ZHigh-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.1553-734X1553-735810.1371/journal.pcbi.1008904https://doaj.org/article/4880db1e9f7e492dafc5a56310cfa0572021-08-01T00:00:00Zhttps://doi.org/10.1371/journal.pcbi.1008904https://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies.Wesley M MarinRavi DandekarDanillo G AugustoTasneem YusufaliBianca HeynJan HofmannVinzenz LangeJürgen SauterPaul J NormanJill A HollenbachPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 17, Iss 8, p e1008904 (2021)
institution DOAJ
collection DOAJ
language EN
topic Biology (General)
QH301-705.5
spellingShingle Biology (General)
QH301-705.5
Wesley M Marin
Ravi Dandekar
Danillo G Augusto
Tasneem Yusufali
Bianca Heyn
Jan Hofmann
Vinzenz Lange
Jürgen Sauter
Paul J Norman
Jill A Hollenbach
High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.
description The killer-cell immunoglobulin-like receptor (KIR) complex on chromosome 19 encodes receptors that modulate the activity of natural killer cells, and variation in these genes has been linked to infectious and autoimmune disease, as well as having bearing on pregnancy and transplant outcomes. The medical relevance and high variability of KIR genes makes short-read sequencing an attractive technology for interrogating the region, providing a high-throughput, high-fidelity sequencing method that is cost-effective. However, because this gene complex is characterized by extensive nucleotide polymorphism, structural variation including gene fusions and deletions, and a high level of homology between genes, its interrogation at high resolution has been thwarted by bioinformatic challenges, with most studies limited to examining presence or absence of specific genes. Here, we present the PING (Pushing Immunogenetics to the Next Generation) pipeline, which incorporates empirical data, novel alignment strategies and a custom alignment processing workflow to enable high-throughput KIR sequence analysis from short-read data. PING provides KIR gene copy number classification functionality for all KIR genes through use of a comprehensive alignment reference. The gene copy number determined per individual enables an innovative genotype determination workflow using genotype-matched references. Together, these methods address the challenges imposed by the structural complexity and overall homology of the KIR complex. To determine copy number and genotype determination accuracy, we applied PING to European and African validation cohorts and a synthetic dataset. PING demonstrated exceptional copy number determination performance across all datasets and robust genotype determination performance. Finally, an investigation into discordant genotypes for the synthetic dataset provides insight into misaligned reads, advancing our understanding in interpretation of short-read sequencing data in complex genomic regions. PING promises to support a new era of studies of KIR polymorphism, delivering high-resolution KIR genotypes that are highly accurate, enabling high-quality, high-throughput KIR genotyping for disease and population studies.
format article
author Wesley M Marin
Ravi Dandekar
Danillo G Augusto
Tasneem Yusufali
Bianca Heyn
Jan Hofmann
Vinzenz Lange
Jürgen Sauter
Paul J Norman
Jill A Hollenbach
author_facet Wesley M Marin
Ravi Dandekar
Danillo G Augusto
Tasneem Yusufali
Bianca Heyn
Jan Hofmann
Vinzenz Lange
Jürgen Sauter
Paul J Norman
Jill A Hollenbach
author_sort Wesley M Marin
title High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.
title_short High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.
title_full High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.
title_fullStr High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.
title_full_unstemmed High-throughput Interpretation of Killer-cell Immunoglobulin-like Receptor Short-read Sequencing Data with PING.
title_sort high-throughput interpretation of killer-cell immunoglobulin-like receptor short-read sequencing data with ping.
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/4880db1e9f7e492dafc5a56310cfa057
work_keys_str_mv AT wesleymmarin highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT ravidandekar highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT danillogaugusto highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT tasneemyusufali highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT biancaheyn highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT janhofmann highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT vinzenzlange highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT jurgensauter highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT pauljnorman highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
AT jillahollenbach highthroughputinterpretationofkillercellimmunoglobulinlikereceptorshortreadsequencingdatawithping
_version_ 1718375775610077184