Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes

Abstract The increasing number of available genomic data allowed the development of phylogenomic analytical tools. Current methods compile information from single gene phylogenies, whether based on topologies or multiple sequence alignments. Generally, phylogenomic analyses elect gene families or ge...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Sandra Regina Maruyama, Luana Aparecida Rogerio, Patricia Domingues Freitas, Marta Maria Geraldes Teixeira, José Marcos Chaves Ribeiro
Formato: article
Lenguaje:EN
Publicado: Nature Portfolio 2021
Materias:
R
Q
Acceso en línea:https://doaj.org/article/621c8bd43681428cae759433ffcfeb0f
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:621c8bd43681428cae759433ffcfeb0f
record_format dspace
spelling oai:doaj.org-article:621c8bd43681428cae759433ffcfeb0f2021-12-02T14:21:53ZTotal Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes10.1038/s41598-021-81926-w2045-2322https://doaj.org/article/621c8bd43681428cae759433ffcfeb0f2021-02-01T00:00:00Zhttps://doi.org/10.1038/s41598-021-81926-whttps://doaj.org/toc/2045-2322Abstract The increasing number of available genomic data allowed the development of phylogenomic analytical tools. Current methods compile information from single gene phylogenies, whether based on topologies or multiple sequence alignments. Generally, phylogenomic analyses elect gene families or genomic regions to construct phylogenomic trees. Here, we presented an alternative approach for Phylogenomics, named TOMM (Total Ortholog Median Matrix), to construct a representative phylogram composed by amino acid distance measures of all pairwise ortholog protein sequence pairs from desired species inside a group of organisms. The procedure is divided two main steps, (1) ortholog detection and (2) creation of a matrix with the median amino acid distance measures of all pairwise orthologous sequences. We tested this approach within three different group of organisms: Kinetoplastida protozoa, hematophagous Diptera vectors and Primates. Our approach was robust and efficacious to reconstruct the phylogenetic relationships for the three groups. Moreover, novel branch topologies could be achieved, providing insights about some phylogenetic relationships between some taxa.Sandra Regina MaruyamaLuana Aparecida RogerioPatricia Domingues FreitasMarta Maria Geraldes TeixeiraJosé Marcos Chaves RibeiroNature PortfolioarticleMedicineRScienceQENScientific Reports, Vol 11, Iss 1, Pp 1-16 (2021)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Sandra Regina Maruyama
Luana Aparecida Rogerio
Patricia Domingues Freitas
Marta Maria Geraldes Teixeira
José Marcos Chaves Ribeiro
Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
description Abstract The increasing number of available genomic data allowed the development of phylogenomic analytical tools. Current methods compile information from single gene phylogenies, whether based on topologies or multiple sequence alignments. Generally, phylogenomic analyses elect gene families or genomic regions to construct phylogenomic trees. Here, we presented an alternative approach for Phylogenomics, named TOMM (Total Ortholog Median Matrix), to construct a representative phylogram composed by amino acid distance measures of all pairwise ortholog protein sequence pairs from desired species inside a group of organisms. The procedure is divided two main steps, (1) ortholog detection and (2) creation of a matrix with the median amino acid distance measures of all pairwise orthologous sequences. We tested this approach within three different group of organisms: Kinetoplastida protozoa, hematophagous Diptera vectors and Primates. Our approach was robust and efficacious to reconstruct the phylogenetic relationships for the three groups. Moreover, novel branch topologies could be achieved, providing insights about some phylogenetic relationships between some taxa.
format article
author Sandra Regina Maruyama
Luana Aparecida Rogerio
Patricia Domingues Freitas
Marta Maria Geraldes Teixeira
José Marcos Chaves Ribeiro
author_facet Sandra Regina Maruyama
Luana Aparecida Rogerio
Patricia Domingues Freitas
Marta Maria Geraldes Teixeira
José Marcos Chaves Ribeiro
author_sort Sandra Regina Maruyama
title Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
title_short Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
title_full Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
title_fullStr Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
title_full_unstemmed Total Ortholog Median Matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
title_sort total ortholog median matrix as an alternative unsupervised approach for phylogenomics based on evolutionary distance between protein coding genes
publisher Nature Portfolio
publishDate 2021
url https://doaj.org/article/621c8bd43681428cae759433ffcfeb0f
work_keys_str_mv AT sandrareginamaruyama totalorthologmedianmatrixasanalternativeunsupervisedapproachforphylogenomicsbasedonevolutionarydistancebetweenproteincodinggenes
AT luanaaparecidarogerio totalorthologmedianmatrixasanalternativeunsupervisedapproachforphylogenomicsbasedonevolutionarydistancebetweenproteincodinggenes
AT patriciadominguesfreitas totalorthologmedianmatrixasanalternativeunsupervisedapproachforphylogenomicsbasedonevolutionarydistancebetweenproteincodinggenes
AT martamariageraldesteixeira totalorthologmedianmatrixasanalternativeunsupervisedapproachforphylogenomicsbasedonevolutionarydistancebetweenproteincodinggenes
AT josemarcoschavesribeiro totalorthologmedianmatrixasanalternativeunsupervisedapproachforphylogenomicsbasedonevolutionarydistancebetweenproteincodinggenes
_version_ 1718391490171895808