CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation

MHC-I associated peptides (MAPs) play a central role in the elimination of virus-infected and neoplastic cells by CD8 T cells. However, accurately predicting the MAP repertoire remains difficult, because only a fraction of the transcriptome generates MAPs. In this study, we investigated whether codo...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Tariq Daouda, Maude Dumont-Lagacé, Albert Feghaly, Yahya Benslimane, Rébecca Panes, Mathieu Courcelles, Mohamed Benhammadi, Lea Harrington, Pierre Thibault, François Major, Yoshua Bengio, Étienne Gagnon, Sébastien Lemieux, Claude Perreault
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2021
Materias:
Acceso en línea:https://doaj.org/article/cbd385c9465f4eb1bec7127d68136b08
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:cbd385c9465f4eb1bec7127d68136b08
record_format dspace
spelling oai:doaj.org-article:cbd385c9465f4eb1bec7127d68136b082021-11-18T05:49:15ZCAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation1553-734X1553-7358https://doaj.org/article/cbd385c9465f4eb1bec7127d68136b082021-10-01T00:00:00Zhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8577786/?tool=EBIhttps://doaj.org/toc/1553-734Xhttps://doaj.org/toc/1553-7358MHC-I associated peptides (MAPs) play a central role in the elimination of virus-infected and neoplastic cells by CD8 T cells. However, accurately predicting the MAP repertoire remains difficult, because only a fraction of the transcriptome generates MAPs. In this study, we investigated whether codon arrangement (usage and placement) regulates MAP biogenesis. We developed an artificial neural network called Codon Arrangement MAP Predictor (CAMAP), predicting MAP presentation solely from mRNA sequences flanking the MAP-coding codons (MCCs), while excluding the MCC per se. CAMAP predictions were significantly more accurate when using original codon sequences than shuffled codon sequences which reflect amino acid usage. Furthermore, predictions were independent of mRNA expression and MAP binding affinity to MHC-I molecules and applied to several cell types and species. Combining MAP ligand scores, transcript expression level and CAMAP scores was particularly useful to increase MAP prediction accuracy. Using an in vitro assay, we showed that varying the synonymous codons in the regions flanking the MCCs (without changing the amino acid sequence) resulted in significant modulation of MAP presentation at the cell surface. Taken together, our results demonstrate the role of codon arrangement in the regulation of MAP presentation and support integration of both translational and post-translational events in predictive algorithms to ameliorate modeling of the immunopeptidome. Author summary MHC-I associated peptides (MAPs) are small fragments of intracellular proteins presented at the surface of cells and used by the immune system to detect and eliminate cancerous or virus-infected cells. While it is theoretically possible to predict which portions of the intracellular proteins will be naturally processed by the cells to ultimately reach the surface, current methodologies have prohibitively high false discovery rates. Here we introduce an artificial neural network called Codon Arrangement MAP Predictor (CAMAP) which integrates information from mRNA-to-protein translation to other factors regulating MAP biogenesis (e.g. MAP ligand score and transcript expression levels) to improve MAP prediction accuracy. While most MAP predictive approaches focus on MAP sequences per se, CAMAP’s novelty is to analyze the MAP-flanking mRNA sequences, thereby providing completely independent information for MAP prediction. We show on several datasets that the integration of CAMAP scores with other known factors involved in MAP presentation (i.e. MAP ligand score and mRNA expression) significantly improves MAP prediction accuracy, and further validate CAMAP learned features using an in-vitro assay. These findings may have major implications for the design of vaccines against cancers and viruses, and in times of pandemics could accelerate the identification of relevant MAPs of viral origins.Tariq DaoudaMaude Dumont-LagacéAlbert FeghalyYahya BenslimaneRébecca PanesMathieu CourcellesMohamed BenhammadiLea HarringtonPierre ThibaultFrançois MajorYoshua BengioÉtienne GagnonSébastien LemieuxClaude PerreaultPublic Library of Science (PLoS)articleBiology (General)QH301-705.5ENPLoS Computational Biology, Vol 17, Iss 10 (2021)
institution DOAJ
collection DOAJ
language EN
topic Biology (General)
QH301-705.5
spellingShingle Biology (General)
QH301-705.5
Tariq Daouda
Maude Dumont-Lagacé
Albert Feghaly
Yahya Benslimane
Rébecca Panes
Mathieu Courcelles
Mohamed Benhammadi
Lea Harrington
Pierre Thibault
François Major
Yoshua Bengio
Étienne Gagnon
Sébastien Lemieux
Claude Perreault
CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation
description MHC-I associated peptides (MAPs) play a central role in the elimination of virus-infected and neoplastic cells by CD8 T cells. However, accurately predicting the MAP repertoire remains difficult, because only a fraction of the transcriptome generates MAPs. In this study, we investigated whether codon arrangement (usage and placement) regulates MAP biogenesis. We developed an artificial neural network called Codon Arrangement MAP Predictor (CAMAP), predicting MAP presentation solely from mRNA sequences flanking the MAP-coding codons (MCCs), while excluding the MCC per se. CAMAP predictions were significantly more accurate when using original codon sequences than shuffled codon sequences which reflect amino acid usage. Furthermore, predictions were independent of mRNA expression and MAP binding affinity to MHC-I molecules and applied to several cell types and species. Combining MAP ligand scores, transcript expression level and CAMAP scores was particularly useful to increase MAP prediction accuracy. Using an in vitro assay, we showed that varying the synonymous codons in the regions flanking the MCCs (without changing the amino acid sequence) resulted in significant modulation of MAP presentation at the cell surface. Taken together, our results demonstrate the role of codon arrangement in the regulation of MAP presentation and support integration of both translational and post-translational events in predictive algorithms to ameliorate modeling of the immunopeptidome. Author summary MHC-I associated peptides (MAPs) are small fragments of intracellular proteins presented at the surface of cells and used by the immune system to detect and eliminate cancerous or virus-infected cells. While it is theoretically possible to predict which portions of the intracellular proteins will be naturally processed by the cells to ultimately reach the surface, current methodologies have prohibitively high false discovery rates. Here we introduce an artificial neural network called Codon Arrangement MAP Predictor (CAMAP) which integrates information from mRNA-to-protein translation to other factors regulating MAP biogenesis (e.g. MAP ligand score and transcript expression levels) to improve MAP prediction accuracy. While most MAP predictive approaches focus on MAP sequences per se, CAMAP’s novelty is to analyze the MAP-flanking mRNA sequences, thereby providing completely independent information for MAP prediction. We show on several datasets that the integration of CAMAP scores with other known factors involved in MAP presentation (i.e. MAP ligand score and mRNA expression) significantly improves MAP prediction accuracy, and further validate CAMAP learned features using an in-vitro assay. These findings may have major implications for the design of vaccines against cancers and viruses, and in times of pandemics could accelerate the identification of relevant MAPs of viral origins.
format article
author Tariq Daouda
Maude Dumont-Lagacé
Albert Feghaly
Yahya Benslimane
Rébecca Panes
Mathieu Courcelles
Mohamed Benhammadi
Lea Harrington
Pierre Thibault
François Major
Yoshua Bengio
Étienne Gagnon
Sébastien Lemieux
Claude Perreault
author_facet Tariq Daouda
Maude Dumont-Lagacé
Albert Feghaly
Yahya Benslimane
Rébecca Panes
Mathieu Courcelles
Mohamed Benhammadi
Lea Harrington
Pierre Thibault
François Major
Yoshua Bengio
Étienne Gagnon
Sébastien Lemieux
Claude Perreault
author_sort Tariq Daouda
title CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation
title_short CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation
title_full CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation
title_fullStr CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation
title_full_unstemmed CAMAP: Artificial neural networks unveil the role of codon arrangement in modulating MHC-I peptides presentation
title_sort camap: artificial neural networks unveil the role of codon arrangement in modulating mhc-i peptides presentation
publisher Public Library of Science (PLoS)
publishDate 2021
url https://doaj.org/article/cbd385c9465f4eb1bec7127d68136b08
work_keys_str_mv AT tariqdaouda camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT maudedumontlagace camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT albertfeghaly camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT yahyabenslimane camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT rebeccapanes camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT mathieucourcelles camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT mohamedbenhammadi camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT leaharrington camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT pierrethibault camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT francoismajor camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT yoshuabengio camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT etiennegagnon camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT sebastienlemieux camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
AT claudeperreault camapartificialneuralnetworksunveiltheroleofcodonarrangementinmodulatingmhcipeptidespresentation
_version_ 1718424816761962496