Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences

Abstract The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health reso...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Juliana Carneiro Gomes, Aras Ismael Masood, Leandro Honorato de S. Silva, Janderson Romário B. da Cruz Ferreira, Agostinho Antônio Freire Júnior, Allana Laís dos Santos Rocha, Letícia Castro Portela de Oliveira, Nathália Regina Cauás da Silva, Bruno José Torres Fernandes, Wellington Pinheiro dos Santos
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/1d8c0342c1214426a93cc59753429af7
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:1d8c0342c1214426a93cc59753429af7
record_format dspace
spelling oai:doaj.org-article:1d8c0342c1214426a93cc59753429af72021-12-02T17:51:21ZCovid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences10.1038/s41598-021-90766-72045-2322https://doaj.org/article/1d8c0342c1214426a93cc59753429af72021-06-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-90766-7https://doaj.org/toc/2045-2322Abstract The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health resources in most countries. RT-PCR is Covid-19’s reference diagnostic method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences with overlap in a pseudo-convolutional approach and represented by co-occurrence matrices. This technique eliminates multiple sequence alignment. Through the proposed method, it is possible to identify virus sequences from a large database: 347,363 virus DNA sequences from 24 virus families and SARS-CoV-2. When comparing SARS-CoV-2 with virus families with similar symptoms, we obtained $$0.97 \pm 0.03$$ 0.97 ± 0.03 for sensitivity and $$0.9919 \pm 0.0005$$ 0.9919 ± 0.0005 for specificity with MLP classifier and 30% overlap. When SARS-CoV-2 is compared to other coronaviruses and healthy human DNA sequences, we obtained $$0.99 \pm 0.01$$ 0.99 ± 0.01 for sensitivity and $$0.9986 \pm 0.0002$$ 0.9986 ± 0.0002 for specificity with MLP and 50% overlap. Therefore, the molecular diagnosis of Covid-19 can be optimized by combining RT-PCR and our pseudo-convolutional method to identify DNA sequences for SARS-CoV-2 with greater specificity and sensitivity.Juliana Carneiro GomesAras Ismael MasoodLeandro Honorato de S. SilvaJanderson Romário B. da Cruz FerreiraAgostinho Antônio Freire JúniorAllana Laís dos Santos RochaLetícia Castro Portela de OliveiraNathália Regina Cauás da SilvaBruno José Torres FernandesWellington Pinheiro dos SantosNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-28 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Juliana Carneiro Gomes
Aras Ismael Masood
Leandro Honorato de S. Silva
Janderson Romário B. da Cruz Ferreira
Agostinho Antônio Freire Júnior
Allana Laís dos Santos Rocha
Letícia Castro Portela de Oliveira
Nathália Regina Cauás da Silva
Bruno José Torres Fernandes
Wellington Pinheiro dos Santos
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
description Abstract The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health resources in most countries. RT-PCR is Covid-19’s reference diagnostic method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences with overlap in a pseudo-convolutional approach and represented by co-occurrence matrices. This technique eliminates multiple sequence alignment. Through the proposed method, it is possible to identify virus sequences from a large database: 347,363 virus DNA sequences from 24 virus families and SARS-CoV-2. When comparing SARS-CoV-2 with virus families with similar symptoms, we obtained $$0.97 \pm 0.03$$ 0.97 ± 0.03 for sensitivity and $$0.9919 \pm 0.0005$$ 0.9919 ± 0.0005 for specificity with MLP classifier and 30% overlap. When SARS-CoV-2 is compared to other coronaviruses and healthy human DNA sequences, we obtained $$0.99 \pm 0.01$$ 0.99 ± 0.01 for sensitivity and $$0.9986 \pm 0.0002$$ 0.9986 ± 0.0002 for specificity with MLP and 50% overlap. Therefore, the molecular diagnosis of Covid-19 can be optimized by combining RT-PCR and our pseudo-convolutional method to identify DNA sequences for SARS-CoV-2 with greater specificity and sensitivity.
format article
author Juliana Carneiro Gomes
Aras Ismael Masood
Leandro Honorato de S. Silva
Janderson Romário B. da Cruz Ferreira
Agostinho Antônio Freire Júnior
Allana Laís dos Santos Rocha
Letícia Castro Portela de Oliveira
Nathália Regina Cauás da Silva
Bruno José Torres Fernandes
Wellington Pinheiro dos Santos
author_facet Juliana Carneiro Gomes
Aras Ismael Masood
Leandro Honorato de S. Silva
Janderson Romário B. da Cruz Ferreira
Agostinho Antônio Freire Júnior
Allana Laís dos Santos Rocha
Letícia Castro Portela de Oliveira
Nathália Regina Cauás da Silva
Bruno José Torres Fernandes
Wellington Pinheiro dos Santos
author_sort Juliana Carneiro Gomes
title Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
title_short Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
title_full Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
title_fullStr Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
title_full_unstemmed Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
title_sort covid-19 diagnosis by combining rt-pcr and pseudo-convolutional machines to characterize virus sequences
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/1d8c0342c1214426a93cc59753429af7
work_keys_str_mv AT julianacarneirogomes covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT arasismaelmasood covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT leandrohonoratodessilva covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT jandersonromariobdacruzferreira covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT agostinhoantoniofreirejunior covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT allanalaisdossantosrocha covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT leticiacastroporteladeoliveira covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT nathaliareginacauasdasilva covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT brunojosetorresfernandes covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
AT wellingtonpinheirodossantos covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences
_version_ 1718379272276541440