Bayesian variable selection for probit mixed models

Abstract : In computational biology, gene expression datasets are characterized by very few individual samples compared to a large number of measurments per sample. Thus, it is appealing to merge these datasets in order to increase the number of observations and diversify the data, allowing a more reliable selection of genes relevant to the biological problem. This necessitates the introduction of the dataset as a random effect. Extending previous work of Lee et al. (2003), a method is proposed to select relevant variables among tens of thousands in a probit mixed regression model, considered as part of a larger hierarchical Bayesian model. Latent variables are used to identify subsets of selected variables and the collapsing technique of Liu (1994) is combined with a Metropolis-within-Gibbs algorithm (Robert and Casella, 2004). The method is applied to a merged dataset made of three individual gene expression datasets, in which tens of thousands of measurements are available for each of several hundred human breast cancer samples. Even for this large dataset comprised of around 20000 predictors, the method is shown to be efficient and feasible. As a demonstration, it is used to select the most important genes that characterize the estrogen receptor status of the cancer patients.
Type de document :
Communication dans un congrès
42èmes Journées de Statistique, 2010, Marseille, France, France. 2010
Liste complète des métadonnées

Littérature citée [2 références]  Voir  Masquer  Télécharger

https://hal.inria.fr/inria-00494781
Contributeur : Conférence Sfds-Hal <>
Soumis le : jeudi 24 juin 2010 - 08:58:30
Dernière modification le : jeudi 18 janvier 2018 - 01:25:31
Document(s) archivé(s) le : lundi 22 octobre 2012 - 14:46:50

Fichier

p151.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : inria-00494781, version 1

Collections

Citation

Meïli Baragatti. Bayesian variable selection for probit mixed models. 42èmes Journées de Statistique, 2010, Marseille, France, France. 2010. 〈inria-00494781〉

Partager

Métriques

Consultations de la notice

107

Téléchargements de fichiers

73