Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.

The cannabis community typically uses the terms "Sativa" and "Indica" to characterize drug strains with high tetrahydrocannabinol (THC) levels. Due to large scale, extensive, and unrecorded hybridization in the past 40 years, this vernacular naming convention has become unreliabl...

Descripción completa

Guardado en:

Detalles Bibliográficos
Autores principales:	Dan Jin, Philippe Henry, Jacqueline Shan, Jie Chen
Formato:	article
Lenguaje:	EN
Publicado:	Public Library of Science (PLoS) 2021
Materias:	Medicine R Science Q
Acceso en línea:	https://doaj.org/article/9ea08975c32a4785af5ce1c6b56b634e
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

id	oai:doaj.org-article:9ea08975c32a4785af5ce1c6b56b634e
record_format	dspace
spelling	oai:doaj.org-article:9ea08975c32a4785af5ce1c6b56b634e2021-12-02T20:09:54ZClassification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.1932-620310.1371/journal.pone.0253387https://doaj.org/article/9ea08975c32a4785af5ce1c6b56b634e2021-01-01T00:00:00Zhttps://doi.org/10.1371/journal.pone.0253387https://doaj.org/toc/1932-6203The cannabis community typically uses the terms "Sativa" and "Indica" to characterize drug strains with high tetrahydrocannabinol (THC) levels. Due to large scale, extensive, and unrecorded hybridization in the past 40 years, this vernacular naming convention has become unreliable and inadequate for identifying or selecting strains for clinical research and medicinal production. Additionally, cannabidiol (CBD) dominant strains and balanced strains (or intermediate strains, which have intermediate levels of THC and CBD), are not included in the current classification studies despite the increasing research interest in the therapeutic potential of CBD. This paper is the first in a series of studies proposing that a new classification system be established based on genome-wide variation and supplemented by data on secondary metabolites and morphological characteristics. This study performed a whole-genome sequencing of 23 cannabis strains marketed in Canada, aligned sequences to a reference genome, and, after filtering for minor allele frequency of 10%, identified 137,858 single nucleotide polymorphisms (SNPs). Discriminant analysis of principal components (DAPC) was applied to these SNPs and further identified 344 structural SNPs, which classified individual strains into five chemotype-aligned groups: one CBD dominant, one balanced, and three THC dominant clusters. These structural SNPs were all multiallelic and were predominantly tri-allelic (339/344). The largest portion of these SNPs (37%) occurred on the same chromosome containing genes for CBD acid synthases (CBDAS) and THC acid synthases (THCAS). The remainder (63%) were located on the other nine chromosomes. These results showed that the genetic differences between modern cannabis strains were at a whole-genome level and not limited to THC or CBD production. These SNPs contained enough genetic variation for classifying individual strains into corresponding chemotypes. In an effort to elucidate the confused genetic backgrounds of commercially available cannabis strains, this classification attempt investigated the utility of DAPC for classifying modern cannabis strains and for identifying structural SNPs.Dan JinPhilippe HenryJacqueline ShanJie ChenPublic Library of Science (PLoS)articleMedicineRScienceQENPLoS ONE, Vol 16, Iss 6, p e0253387 (2021)
institution	DOAJ
collection	DOAJ
language	EN
topic	Medicine R Science Q
spellingShingle	Medicine R Science Q Dan Jin Philippe Henry Jacqueline Shan Jie Chen Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
description	The cannabis community typically uses the terms "Sativa" and "Indica" to characterize drug strains with high tetrahydrocannabinol (THC) levels. Due to large scale, extensive, and unrecorded hybridization in the past 40 years, this vernacular naming convention has become unreliable and inadequate for identifying or selecting strains for clinical research and medicinal production. Additionally, cannabidiol (CBD) dominant strains and balanced strains (or intermediate strains, which have intermediate levels of THC and CBD), are not included in the current classification studies despite the increasing research interest in the therapeutic potential of CBD. This paper is the first in a series of studies proposing that a new classification system be established based on genome-wide variation and supplemented by data on secondary metabolites and morphological characteristics. This study performed a whole-genome sequencing of 23 cannabis strains marketed in Canada, aligned sequences to a reference genome, and, after filtering for minor allele frequency of 10%, identified 137,858 single nucleotide polymorphisms (SNPs). Discriminant analysis of principal components (DAPC) was applied to these SNPs and further identified 344 structural SNPs, which classified individual strains into five chemotype-aligned groups: one CBD dominant, one balanced, and three THC dominant clusters. These structural SNPs were all multiallelic and were predominantly tri-allelic (339/344). The largest portion of these SNPs (37%) occurred on the same chromosome containing genes for CBD acid synthases (CBDAS) and THC acid synthases (THCAS). The remainder (63%) were located on the other nine chromosomes. These results showed that the genetic differences between modern cannabis strains were at a whole-genome level and not limited to THC or CBD production. These SNPs contained enough genetic variation for classifying individual strains into corresponding chemotypes. In an effort to elucidate the confused genetic backgrounds of commercially available cannabis strains, this classification attempt investigated the utility of DAPC for classifying modern cannabis strains and for identifying structural SNPs.
format	article
author	Dan Jin Philippe Henry Jacqueline Shan Jie Chen
author_facet	Dan Jin Philippe Henry Jacqueline Shan Jie Chen
author_sort	Dan Jin
title	Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
title_short	Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
title_full	Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
title_fullStr	Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
title_full_unstemmed	Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
title_sort	classification of cannabis strains in the canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.
publisher	Public Library of Science (PLoS)
publishDate	2021
url	https://doaj.org/article/9ea08975c32a4785af5ce1c6b56b634e
work_keys_str_mv	AT danjin classificationofcannabisstrainsinthecanadianmarketwithdiscriminantanalysisofprincipalcomponentsusinggenomewidesinglenucleotidepolymorphisms AT philippehenry classificationofcannabisstrainsinthecanadianmarketwithdiscriminantanalysisofprincipalcomponentsusinggenomewidesinglenucleotidepolymorphisms AT jacquelineshan classificationofcannabisstrainsinthecanadianmarketwithdiscriminantanalysisofprincipalcomponentsusinggenomewidesinglenucleotidepolymorphisms AT jiechen classificationofcannabisstrainsinthecanadianmarketwithdiscriminantanalysisofprincipalcomponentsusinggenomewidesinglenucleotidepolymorphisms
_version_	1718375081247244288

Classification of cannabis strains in the Canadian market with discriminant analysis of principal components using genome-wide single nucleotide polymorphisms.

Ejemplares similares