A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.

The genetic diversity of plants has traditionally been employed to improve crop plants to suit human needs, and in the future feed the increasing population and protect crops from environmental stresses and climate change. Genome-wide sequencing is a reality and can be used to make association to cr...

Descripción completa

Guardado en:
Detalles Bibliográficos
Autores principales: Subodh K Srivastava, Pawel Wolinski, Andy Pereira
Formato: article
Lenguaje:EN
Publicado: Public Library of Science (PLoS) 2014
Materias:
R
Q
Acceso en línea:https://doaj.org/article/ae6c667ac93c41cdac0b784017475d04
Etiquetas: Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
id oai:doaj.org-article:ae6c667ac93c41cdac0b784017475d04
record_format dspace
spelling oai:doaj.org-article:ae6c667ac93c41cdac0b784017475d042021-11-25T06:00:05ZA strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.1932-620310.1371/journal.pone.0105335https://doaj.org/article/ae6c667ac93c41cdac0b784017475d042014-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0105335https://doaj.org/toc/1932-6203The genetic diversity of plants has traditionally been employed to improve crop plants to suit human needs, and in the future feed the increasing population and protect crops from environmental stresses and climate change. Genome-wide sequencing is a reality and can be used to make association to crop traits to be utilized by high-throughput marker based selection methods. This study describes a strategy of using next generation sequencing (NGS) data from the rice genome to make comparisons to the high-quality reference genome, identify functional polymorphisms within genes that might result in function changes and be used to study correlations to traits and employed in genetic mapping. We analyzed the NGS data of Oryza sativa ssp indica cv. G4 covering 241 Mb with ∼20X coverage and compared to the reference genome of Oryza sativa ssp. japonica to describe the genome-wide distribution of gene-based single nucleotide polymorphisms (SNPs). The analysis shows that the 63% covered genome consists of 1.6 million SNPs with 6.9 SNPs/Kb, and including 80,146 insertions and 92,655 deletions (INDELs) genome-wide. There are a total of 1,139,801 intergenic SNPs, 295,136 SNPs in intronic/non-coding regions, 195,098 in coding regions, 23,242 SNPs at the five-prime (5') UTR regions and 22,686 SNPs at the three-prime (3') UTR region. SNP variation was found in 40,761 gene loci, which include 75,262 synonymous and 119,836 non-synonymous changes, and functional reading frame changes through 3,886 inducing STOP-codon (isSNP) and 729 preventing STOP-codon (psSNP) variation. There are quickly evolving 194 high SNP hotspot genes (>100 SNPs/gene), and 1,513 out of 2,458 transcription factors displaying 2,294 non-synonymous SNPs that can be a major source of phenotypic diversity within the species. All data is searchable at https://plantstress-pereira.uark.edu/oryza2. We envision that this strategy will be useful for the identification of genes for crop traits and molecular breeding of rice cultivars.Subodh K SrivastavaPawel WolinskiAndy PereiraPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 9, Iss 9, p e105335 (2014)
institution DOAJ
collection DOAJ
language EN
topic Medicine
R
Science
Q
spellingShingle Medicine
R
Science
Q
Subodh K Srivastava
Pawel Wolinski
Andy Pereira
A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
description The genetic diversity of plants has traditionally been employed to improve crop plants to suit human needs, and in the future feed the increasing population and protect crops from environmental stresses and climate change. Genome-wide sequencing is a reality and can be used to make association to crop traits to be utilized by high-throughput marker based selection methods. This study describes a strategy of using next generation sequencing (NGS) data from the rice genome to make comparisons to the high-quality reference genome, identify functional polymorphisms within genes that might result in function changes and be used to study correlations to traits and employed in genetic mapping. We analyzed the NGS data of Oryza sativa ssp indica cv. G4 covering 241 Mb with ∼20X coverage and compared to the reference genome of Oryza sativa ssp. japonica to describe the genome-wide distribution of gene-based single nucleotide polymorphisms (SNPs). The analysis shows that the 63% covered genome consists of 1.6 million SNPs with 6.9 SNPs/Kb, and including 80,146 insertions and 92,655 deletions (INDELs) genome-wide. There are a total of 1,139,801 intergenic SNPs, 295,136 SNPs in intronic/non-coding regions, 195,098 in coding regions, 23,242 SNPs at the five-prime (5') UTR regions and 22,686 SNPs at the three-prime (3') UTR region. SNP variation was found in 40,761 gene loci, which include 75,262 synonymous and 119,836 non-synonymous changes, and functional reading frame changes through 3,886 inducing STOP-codon (isSNP) and 729 preventing STOP-codon (psSNP) variation. There are quickly evolving 194 high SNP hotspot genes (>100 SNPs/gene), and 1,513 out of 2,458 transcription factors displaying 2,294 non-synonymous SNPs that can be a major source of phenotypic diversity within the species. All data is searchable at https://plantstress-pereira.uark.edu/oryza2. We envision that this strategy will be useful for the identification of genes for crop traits and molecular breeding of rice cultivars.
format article
author Subodh K Srivastava
Pawel Wolinski
Andy Pereira
author_facet Subodh K Srivastava
Pawel Wolinski
Andy Pereira
author_sort Subodh K Srivastava
title A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
title_short A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
title_full A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
title_fullStr A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
title_full_unstemmed A strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
title_sort strategy for genome-wide identification of gene based polymorphisms in rice reveals non-synonymous variation and functional genotypic markers.
publisher Public Library of Science (PLoS)
publishDate 2014
url https://doaj.org/article/ae6c667ac93c41cdac0b784017475d04
work_keys_str_mv AT subodhksrivastava astrategyforgenomewideidentificationofgenebasedpolymorphismsinricerevealsnonsynonymousvariationandfunctionalgenotypicmarkers
AT pawelwolinski astrategyforgenomewideidentificationofgenebasedpolymorphismsinricerevealsnonsynonymousvariationandfunctionalgenotypicmarkers
AT andypereira astrategyforgenomewideidentificationofgenebasedpolymorphismsinricerevealsnonsynonymousvariationandfunctionalgenotypicmarkers
AT subodhksrivastava strategyforgenomewideidentificationofgenebasedpolymorphismsinricerevealsnonsynonymousvariationandfunctionalgenotypicmarkers
AT pawelwolinski strategyforgenomewideidentificationofgenebasedpolymorphismsinricerevealsnonsynonymousvariationandfunctionalgenotypicmarkers
AT andypereira strategyforgenomewideidentificationofgenebasedpolymorphismsinricerevealsnonsynonymousvariationandfunctionalgenotypicmarkers
_version_ 1718414300853305344