Skip to Main content Skip to Navigation
Conference papers

The Latent Block Model: a useful model for high dimensional data

Christine Keribin 1, 2 Gilles Celeux 1 Valérie Robert 1, 2
1 SELECT - Model selection in statistical learning
LMO - Laboratoire de Mathématiques d'Orsay, Inria Saclay - Ile de France
Abstract : The Latent Block Model (LBM) designs in a same exercise a clustering of the rows and the columns of a data array. Typically the LBM is expected to be useful to analyze huge data sets with many observations and many variables. But it encounters several numerical issues with big data set: maximum likelihood is jeopardized by spurious maxima and selecting a proper model is challenging since there are a lot of models are in competition. In this communication, we analyze these numerical issues. In particular, we make use of Bayesian inference to avoid spurious solutions and propose an efficient way to scan the model set. Moreover, we advocate the exact Integrated Completed Likelihood (ICL) criterion to select a proper and consistent LBM. The methods and algorithms will be ilustrated with pharmacovigilance data involving large arrays of data.
Complete list of metadata

Cited literature [13 references]  Display  Hide  Download

https://hal.inria.fr/hal-01658589
Contributor : Christine Keribin <>
Submitted on : Thursday, December 7, 2017 - 5:11:07 PM
Last modification on : Friday, April 30, 2021 - 9:54:48 AM

File

KERIBIN-CELEUX-ROBERT-ISI17.pd...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01658589, version 1

Collections

Citation

Christine Keribin, Gilles Celeux, Valérie Robert. The Latent Block Model: a useful model for high dimensional data. ISI 2017 - 61st world statistics congress, Jul 2017, Marrakech, Morocco. pp.1-6. ⟨hal-01658589⟩

Share

Metrics

Record views

817

Files downloads

909