inria-00179186, version 1
Parameter Setting for Evolutionary Latent Class Clustering
Damien Tessier
a, 1Marc Schoenauer
a, 1Christophe Biernacki b, 2Gilles Celeux
a, 3Gérard Govaert c, 4
Second International Symposium, ISICA 2007 4683 (2007) 472-484
Abstract: The latent class model or multivariate multinomial mixture is a powerful model for clustering discrete data. This model is expected to be useful to represent non-homogeneous populations. It uses a conditional independence assumption given the latent class to which a statistical unit is belonging. However, it leads to a criterion that proves difficult to optimise by the standard approach based on the EM algorithm. An Evolutionary Algorithms is designed to tackle this discrete optimisation problem, and an extensive parameter study on a large artificial dataset allows to derive stable parameters. Those parameters are then validated on other artificial datasets, as well as on some well-known real data: the Evolutionary Algorithm performs repeatedly better than other standard clustering techniques on the same data.
- a – INRIA
- b – Université des Sciences et Technologie de Lille - Lille I
- c – Université de Technologie de Compiègne
- 1: TAO (INRIA Futurs)
- INRIA – CNRS : UMR8623 – Université Paris XI - Paris Sud
- 2: Laboratoire Paul Painlevé (LPP)
- CNRS : UMR8524 – Université Lille 1 - Sciences et Technologies
- 3: SELECT (INRIA Futurs)
- INRIA – Université Paris XI - Paris Sud
- 4: UMR CNRS 6599 (UMR CNRS 6599)
- Université de Technologie de Compiègne
- Domain : Computer Science/Artificial Intelligence
Mathematics/Statistics
- inria-00179186, version 1
- http://hal.inria.fr/inria-00179186
- oai:hal.inria.fr:inria-00179186
- From: Marc Schoenauer
- Submitted on: Sunday, 14 October 2007 07:39:09
- Updated on: Monday, 15 October 2007 19:09:07






Associated documents
Export