HAL will be down for maintenance from Friday, June 10 at 4pm through Monday, June 13 at 9am. More information
Skip to Main content Skip to Navigation

A statistical approach for CGH microarray data analysis

Franck Picard 1 Stéphane Robin Marc Lavielle 1 Christian Vaisse Gilles Celeux 1 Jean-Jacques Daudin
1 SELECT - Model selection in statistical learning
LMO - Laboratoire de Mathématiques d'Orsay, Inria Saclay - Ile de France
Abstract : Microarray-CGH experiments aim at detecting and mapping chromosomal imbalances, by hybridizing targets of genomic DNA from a test and a reference sample. Probes are constituted by sequences of genomic DNA (BACs) that are mapped on the genome. For this reason, the signal has a spatial coherence that has to be handled by specific statistical tools. Process segmentation seems to be a natural framework for this purpose. A CGH profile can be viewed as a succession of segments that represent homogeneous regions in the genome whose BACs share the same relative copy number in average. We model a CGH profile by a random gaussian process whose distribution parameters are affected by abrupt changes at unknown coordinates. Two major problem arise: the estimation of the break-points coordinates and the estimation of the number of segments. A dynamic programming algorithm is used to partition the data into a finite number of segments. A model selection approach is used to determine the number of segments in the profile, using an adaptative method. We explain why classical penalized criteria can not be used in the context of break-points detection and show the potentialities of our methodology, using publicly available data sets. We detect previously mapped chromosomal aberrations and disuss the performance of our methodology on noisier data concerning breast cancer cell lines.
Document type :
Complete list of metadata

Contributor : Rapport de Recherche Inria Connect in order to contact the contributor
Submitted on : Tuesday, May 23, 2006 - 5:33:40 PM
Last modification on : Wednesday, April 20, 2022 - 3:37:39 AM
Long-term archiving on: : Sunday, April 4, 2010 - 10:14:36 PM


  • HAL Id : inria-00071444, version 1


Franck Picard, Stéphane Robin, Marc Lavielle, Christian Vaisse, Gilles Celeux, et al.. A statistical approach for CGH microarray data analysis. [Research Report] RR-5139, INRIA. 2004. ⟨inria-00071444⟩



Record views


Files downloads