Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences
Abstract The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health reso...
Guardado en:
Autores principales: | , , , , , , , , , |
---|---|
Formato: | article |
Lenguaje: | EN |
Publicado: |
Nature Portfolio
2021
|
Materias: | |
Acceso en línea: | https://doaj.org/article/1d8c0342c1214426a93cc59753429af7 |
Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
id |
oai:doaj.org-article:1d8c0342c1214426a93cc59753429af7 |
---|---|
record_format |
dspace |
spelling |
oai:doaj.org-article:1d8c0342c1214426a93cc59753429af72021-12-02T17:51:21ZCovid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences10.1038/s41598-021-90766-72045-2322https://doaj.org/article/1d8c0342c1214426a93cc59753429af72021-06-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-90766-7https://doaj.org/toc/2045-2322Abstract The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health resources in most countries. RT-PCR is Covid-19’s reference diagnostic method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences with overlap in a pseudo-convolutional approach and represented by co-occurrence matrices. This technique eliminates multiple sequence alignment. Through the proposed method, it is possible to identify virus sequences from a large database: 347,363 virus DNA sequences from 24 virus families and SARS-CoV-2. When comparing SARS-CoV-2 with virus families with similar symptoms, we obtained $$0.97 \pm 0.03$$ 0.97 ± 0.03 for sensitivity and $$0.9919 \pm 0.0005$$ 0.9919 ± 0.0005 for specificity with MLP classifier and 30% overlap. When SARS-CoV-2 is compared to other coronaviruses and healthy human DNA sequences, we obtained $$0.99 \pm 0.01$$ 0.99 ± 0.01 for sensitivity and $$0.9986 \pm 0.0002$$ 0.9986 ± 0.0002 for specificity with MLP and 50% overlap. Therefore, the molecular diagnosis of Covid-19 can be optimized by combining RT-PCR and our pseudo-convolutional method to identify DNA sequences for SARS-CoV-2 with greater specificity and sensitivity.Juliana Carneiro GomesAras Ismael MasoodLeandro Honorato de S. SilvaJanderson Romário B. da Cruz FerreiraAgostinho Antônio Freire JúniorAllana Laís dos Santos RochaLetícia Castro Portela de OliveiraNathália Regina Cauás da SilvaBruno José Torres FernandesWellington Pinheiro dos SantosNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-28 (2021) |
institution |
DOAJ |
collection |
DOAJ |
language |
EN |
topic |
Medicine R Science Q |
spellingShingle |
Medicine R Science Q Juliana Carneiro Gomes Aras Ismael Masood Leandro Honorato de S. Silva Janderson Romário B. da Cruz Ferreira Agostinho Antônio Freire Júnior Allana Laís dos Santos Rocha Letícia Castro Portela de Oliveira Nathália Regina Cauás da Silva Bruno José Torres Fernandes Wellington Pinheiro dos Santos Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences |
description |
Abstract The Covid-19 pandemic, a disease transmitted by the SARS-CoV-2 virus, has already caused the infection of more than 120 million people, of which 70 million have been recovered, while 3 million people have died. The high speed of infection has led to the rapid depletion of public health resources in most countries. RT-PCR is Covid-19’s reference diagnostic method. In this work we propose a new technique for representing DNA sequences: they are divided into smaller sequences with overlap in a pseudo-convolutional approach and represented by co-occurrence matrices. This technique eliminates multiple sequence alignment. Through the proposed method, it is possible to identify virus sequences from a large database: 347,363 virus DNA sequences from 24 virus families and SARS-CoV-2. When comparing SARS-CoV-2 with virus families with similar symptoms, we obtained $$0.97 \pm 0.03$$ 0.97 ± 0.03 for sensitivity and $$0.9919 \pm 0.0005$$ 0.9919 ± 0.0005 for specificity with MLP classifier and 30% overlap. When SARS-CoV-2 is compared to other coronaviruses and healthy human DNA sequences, we obtained $$0.99 \pm 0.01$$ 0.99 ± 0.01 for sensitivity and $$0.9986 \pm 0.0002$$ 0.9986 ± 0.0002 for specificity with MLP and 50% overlap. Therefore, the molecular diagnosis of Covid-19 can be optimized by combining RT-PCR and our pseudo-convolutional method to identify DNA sequences for SARS-CoV-2 with greater specificity and sensitivity. |
format |
article |
author |
Juliana Carneiro Gomes Aras Ismael Masood Leandro Honorato de S. Silva Janderson Romário B. da Cruz Ferreira Agostinho Antônio Freire Júnior Allana Laís dos Santos Rocha Letícia Castro Portela de Oliveira Nathália Regina Cauás da Silva Bruno José Torres Fernandes Wellington Pinheiro dos Santos |
author_facet |
Juliana Carneiro Gomes Aras Ismael Masood Leandro Honorato de S. Silva Janderson Romário B. da Cruz Ferreira Agostinho Antônio Freire Júnior Allana Laís dos Santos Rocha Letícia Castro Portela de Oliveira Nathália Regina Cauás da Silva Bruno José Torres Fernandes Wellington Pinheiro dos Santos |
author_sort |
Juliana Carneiro Gomes |
title |
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences |
title_short |
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences |
title_full |
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences |
title_fullStr |
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences |
title_full_unstemmed |
Covid-19 diagnosis by combining RT-PCR and pseudo-convolutional machines to characterize virus sequences |
title_sort |
covid-19 diagnosis by combining rt-pcr and pseudo-convolutional machines to characterize virus sequences |
publisher |
Nature Portfolio |
publishDate |
2021 |
url |
https://doaj.org/article/1d8c0342c1214426a93cc59753429af7 |
work_keys_str_mv |
AT julianacarneirogomes covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT arasismaelmasood covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT leandrohonoratodessilva covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT jandersonromariobdacruzferreira covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT agostinhoantoniofreirejunior covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT allanalaisdossantosrocha covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT leticiacastroporteladeoliveira covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT nathaliareginacauasdasilva covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT brunojosetorresfernandes covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences AT wellingtonpinheirodossantos covid19diagnosisbycombiningrtpcrandpseudoconvolutionalmachinestocharacterizevirussequences |
_version_ |
1718379272276541440 |