Simultaneous Gaussian Model-Based Clustering for Samples of Multiple Origins - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Computational Statistics Année : 2013

Simultaneous Gaussian Model-Based Clustering for Samples of Multiple Origins

Résumé

Gaussian mixture model-based clustering is now a standard tool to estimate some hypothetical underlying partition of a single dataset. In this paper, we aim to cluster several different datasets at the same time in a context where underlying populations, even though different, are not completely unrelated: All individuals are described by the same features and partitions of identical meaning are expected. Justifying from some natural arguments a stochastic linear link between the components of the mixtures associated to each dataset, we propose some parsimonious and meaningful models for a so-called simultaneous clustering method. Maximum likelihood mixture parameters, subject to the linear link constraint, can be easily estimated by a Generalized Expectation Maximization (GEM) algorithm that we describe. Some promising results are obtained in a biological context where simultaneous clustering outperforms independent clustering for partitioning three different subspecies of birds. Further results on ornithological data show that the proposed strategy is robust to the relaxation of the exact descriptor concordance which is one of its main assumptions.
Fichier principal
Vignette du fichier
classifsimul.pdf (189.97 Ko) Télécharger le fichier
Origine : Fichiers éditeurs autorisés sur une archive ouverte
Loading...

Dates et versions

hal-00921041 , version 1 (19-12-2013)

Identifiants

Citer

Alexandre Lourme, Christophe Biernacki. Simultaneous Gaussian Model-Based Clustering for Samples of Multiple Origins. Computational Statistics, 2013, 28, pp.371-391. ⟨10.1007/s00180-012-0305-5⟩. ⟨hal-00921041⟩
153 Consultations
282 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More