Skip to Main content Skip to Navigation
Reports

Clustering high-throughput sequencing data with Poisson mixture models

Abstract : In recent years gene expression studies have increasingly made use of next generation sequencing technology. In turn, research concerning the appropriate statistical methods for the analysis of digital gene expression has flourished, primarily in the context of normalization and differential analysis. In this work, we focus on the question of clustering digital gene expression profiles as a means to discover groups of co-expressed genes. We propose two parameterizations of a Poisson mixture model to cluster expression profiles of high-throughput sequencing data. A set of simulation studies compares the performance of the proposed models with that of an approach developed for a similar type of data, namely serial analysis of gene expression. We also study the performance of these approaches on two real high-throughput sequencing data sets. The R package HTSCluster used to implement the proposed Poisson mixture models is available on CRAN.
Document type :
Reports
Complete list of metadata

Cited literature [32 references]  Display  Hide  Download

https://hal.inria.fr/hal-01193758
Contributor : Andrea Rau Connect in order to contact the contributor
Submitted on : Thursday, November 3, 2011 - 6:26:48 PM
Last modification on : Sunday, June 26, 2022 - 2:14:03 AM
Long-term archiving on: : Saturday, February 4, 2012 - 2:30:40 AM

Files

RR-7786.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01193758, version 2
  • PRODINRA : 189252

Citation

Andrea Rau, Gilles Celeux, Marie-Laure Martin-Magniette, Cathy Maugis-Rabusseau. Clustering high-throughput sequencing data with Poisson mixture models. [Research Report] RR-7786, INRIA. 2011, pp.36. ⟨hal-01193758v2⟩

Share

Metrics

Record views

1022

Files downloads

1478