Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data

ABSTRACT Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polyg...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Nenad Macesic, Oliver J. Bear Don’t Walk, Itsik Pe’er, Nicholas P. Tatonetti, Anton Y. Peleg, Anne-Catrin Uhlemann
Formato: article
Lenguaje:EN
Publicado: American Society for Microbiology 2020
Materias:
Acceso en línea:https://doaj.org/article/b206dea6959348dd85e0c18e57dc1f9e
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:b206dea6959348dd85e0c18e57dc1f9e
record_format dspace
spelling oai:doaj.org-article:b206dea6959348dd85e0c18e57dc1f9e2021-12-02T19:46:20ZPredicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data10.1128/mSystems.00656-192379-5077https://doaj.org/article/b206dea6959348dd85e0c18e57dc1f9e2020-06-01T00:00:00Zhttps://journals.asm.org/doi/10.1128/mSystems.00656-19https://doaj.org/toc/2379-5077ABSTRACT Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics.Nenad MacesicOliver J. Bear Don’t WalkItsik Pe’erNicholas P. TatonettiAnton Y. PelegAnne-Catrin UhlemannAmerican Society for Microbiologyarticlegenotypephenotypepredictionantimicrobial resistancemachine learningMicrobiologyQR1-502ENmSystems, Vol 5, Iss 3 (2020)
institution DOAJ
collection DOAJ
language EN
topic genotype
phenotype
prediction
antimicrobial resistance
machine learning
Microbiology
QR1-502
spellingShingle genotype
phenotype
prediction
antimicrobial resistance
machine learning
Microbiology
QR1-502
Nenad Macesic
Oliver J. Bear Don’t Walk
Itsik Pe’er
Nicholas P. Tatonetti
Anton Y. Peleg
Anne-Catrin Uhlemann
Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data
description ABSTRACT Polymyxins are used as treatments of last resort for Gram-negative bacterial infections. Their increased use has led to concerns about emerging polymyxin resistance (PR). Phenotypic polymyxin susceptibility testing is resource intensive and difficult to perform accurately. The complex polygenic nature of PR and our incomplete understanding of its genetic basis make it difficult to predict PR using detection of resistance determinants. We therefore applied machine learning (ML) to whole-genome sequencing data from >600 Klebsiella pneumoniae clonal group 258 (CG258) genomes to predict phenotypic PR. Using a reference-based representation of genomic data with ML outperformed a rule-based approach that detected variants in known PR genes (area under receiver-operator curve [AUROC], 0.894 versus 0.791, P = 0.006). We noted modest increases in performance by using a bacterial genome-wide association study to filter relevant genomic features and by integrating clinical data in the form of prior polymyxin exposure. Conversely, reference-free representation of genomic data as k-mers was associated with decreased performance (AUROC, 0.692 versus 0.894, P = 0.015). When ML models were interpreted to extract genomic features, six of seven known PR genes were correctly identified by models without prior programming and several genes involved in stress responses and maintenance of the cell membrane were identified as potential novel determinants of PR. These findings are a proof of concept that whole-genome sequencing data can accurately predict PR in K. pneumoniae CG258 and may be applicable to other forms of complex antimicrobial resistance. IMPORTANCE Polymyxins are last-resort antibiotics used to treat highly resistant Gram-negative bacteria. There are increasing reports of polymyxin resistance emerging, raising concerns of a postantibiotic era. Polymyxin resistance is therefore a significant public health threat, but current phenotypic methods for detection are difficult and time-consuming to perform. There have been increasing efforts to use whole-genome sequencing for detection of antibiotic resistance, but this has been difficult to apply to polymyxin resistance because of its complex polygenic nature. The significance of our research is that we successfully applied machine learning methods to predict polymyxin resistance in Klebsiella pneumoniae clonal group 258, a common health care-associated and multidrug-resistant pathogen. Our findings highlight that machine learning can be successfully applied even in complex forms of antibiotic resistance and represent a significant contribution to the literature that could be used to predict resistance in other bacteria and to other antibiotics.
format article
author Nenad Macesic
Oliver J. Bear Don’t Walk
Itsik Pe’er
Nicholas P. Tatonetti
Anton Y. Peleg
Anne-Catrin Uhlemann
author_facet Nenad Macesic
Oliver J. Bear Don’t Walk
Itsik Pe’er
Nicholas P. Tatonetti
Anton Y. Peleg
Anne-Catrin Uhlemann
author_sort Nenad Macesic
title Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data
title_short Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data
title_full Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data
title_fullStr Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data
title_full_unstemmed Predicting Phenotypic Polymyxin Resistance in <named-content content-type="genus-species">Klebsiella pneumoniae</named-content> through Machine Learning Analysis of Genomic Data
title_sort predicting phenotypic polymyxin resistance in <named-content content-type="genus-species">klebsiella pneumoniae</named-content> through machine learning analysis of genomic data
publisher American Society for Microbiology
publishDate 2020
url https://doaj.org/article/b206dea6959348dd85e0c18e57dc1f9e
work_keys_str_mv AT nenadmacesic predictingphenotypicpolymyxinresistanceinnamedcontentcontenttypegenusspeciesklebsiellapneumoniaenamedcontentthroughmachinelearninganalysisofgenomicdata
AT oliverjbeardontwalk predictingphenotypicpolymyxinresistanceinnamedcontentcontenttypegenusspeciesklebsiellapneumoniaenamedcontentthroughmachinelearninganalysisofgenomicdata
AT itsikpeer predictingphenotypicpolymyxinresistanceinnamedcontentcontenttypegenusspeciesklebsiellapneumoniaenamedcontentthroughmachinelearninganalysisofgenomicdata
AT nicholasptatonetti predictingphenotypicpolymyxinresistanceinnamedcontentcontenttypegenusspeciesklebsiellapneumoniaenamedcontentthroughmachinelearninganalysisofgenomicdata
AT antonypeleg predictingphenotypicpolymyxinresistanceinnamedcontentcontenttypegenusspeciesklebsiellapneumoniaenamedcontentthroughmachinelearninganalysisofgenomicdata
AT annecatrinuhlemann predictingphenotypicpolymyxinresistanceinnamedcontentcontenttypegenusspeciesklebsiellapneumoniaenamedcontentthroughmachinelearninganalysisofgenomicdata
_version_ 1718375984632168448