Functional clustering of periodic transcriptional profiles through ARMA(p,q).

<h4>Background</h4>Gene clustering of periodic transcriptional profiles provides an opportunity to shed light on a variety of biological processes, but this technique relies critically upon the robust modeling of longitudinal covariance structure over time.<h4>Methodology</h4>...

Description complète

Enregistré dans:
Détails bibliographiques
Auteurs principaux: Ning Li, Timothy McMurry, Arthur Berg, Zhong Wang, Scott A Berceli, Rongling Wu
Format: article
Langue:EN
Publié: Public Library of Science (PLoS) 2010
Sujets:
R
Q
Accès en ligne:https://doaj.org/article/a0b5f56ebfc544768d9ac37b92d162e8
Tags: Ajouter un tag
Pas de tags, Soyez le premier à ajouter un tag!
Description
Résumé:<h4>Background</h4>Gene clustering of periodic transcriptional profiles provides an opportunity to shed light on a variety of biological processes, but this technique relies critically upon the robust modeling of longitudinal covariance structure over time.<h4>Methodology</h4>We propose a statistical method for functional clustering of periodic gene expression by modeling the covariance matrix of serial measurements through a general autoregressive moving-average process of order (p,q), the so-called ARMA(p,q). We derive a sophisticated EM algorithm to estimate the proportions of each gene cluster, the Fourier series parameters that define gene-specific differences in periodic expression trajectories, and the ARMA parameters that model the covariance structure within a mixture model framework. The orders and of the ARMA process that provide the best fit are identified by model selection criteria.<h4>Conclusions</h4>Through simulated data we show that whenever it is necessary, employment of sophisticated covariance structures such as ARMA is crucial in order to obtain unbiased estimates of the mean structure parameters and increased precision of estimation. The methods were implemented on recently published time-course gene expression data in yeast and the procedure was shown to effectively identify interesting periodic clusters in the dataset. The new approach will provide a powerful tool for understanding biological functions on a genomic scale.