Enhancing the selection of a model-based clustering with external qualitative variables

Jean-Patrick Baudry 1 Margarida Cardoso 2 Gilles Celeux 3 Maria-José Amorim 4 Ana Sousa Ferreira 5
2 BRU-UNIDE
IST - Instituto Superior Técnico - Technical University of Lisbon
4 ISEL
IST - Instituto Superior Técnico - Technical University of Lisbon
5 BRU-UNIDE & CEAUL
IST - Instituto Superior Técnico - Technical University of Lisbon
Abstract : In cluster analysis, it is often useful to interpret the obtained partition with respect to external qualitative variables (defining known partitions) derived from alternative information. An approach is proposed in the model-based clustering context to select a model and a number of clusters in order to get a partition which both provides a good fit with the data and is related to the external variables. This approach makes use of the integrated joint likelihood of the data, the partition derived from the mixture model and the known partitions. It is worth noticing that the external qualitative variables are only used to select a relevant mixture model. Each mixture model is fitted by the maximum likelihood methodology from the observed data. Numerical experiments illustrate the promising behaviour of the derived criterion.
Document type :
Reports
[Research Report] RR-8124, 2012, pp.14


https://hal.inria.fr/hal-00747387
Contributor : Gilles Celeux <>
Submitted on : Wednesday, October 31, 2012 - 11:17:40 AM
Last modification on : Tuesday, October 28, 2014 - 6:27:46 PM

File

RR-8124.pdf
fileSource_public_author

Identifiers

  • HAL Id : hal-00747387, version 1

Collections

Citation

Jean-Patrick Baudry, Margarida Cardoso, Gilles Celeux, Maria-José Amorim, Ana Sousa Ferreira. Enhancing the selection of a model-based clustering with external qualitative variables. [Research Report] RR-8124, 2012, pp.14. <hal-00747387>

Export

Share

Metrics

Consultation de
la notice

385

Téléchargement du document

225