Skip to Main content Skip to Navigation
Documents associated with scientific events

Simultaneous dimension reduction and multi-objective clustering using probabilistic factorial discriminant analysis

Vincent Vandewalle 1, 2 
2 MODAL - MOdel for Data Analysis and Learning
LPP - Laboratoire Paul Painlevé - UMR 8524, Université de Lille, Sciences et Technologies, Inria Lille - Nord Europe, METRICS - Evaluation des technologies de santé et des pratiques médicales - ULR 2694, Polytech Lille - École polytechnique universitaire de Lille
Abstract : In model based clustering of quantitative data it is often supposed that only one clustering variable explains the heterogeneity of all the others variables. However, when variables come from different sources, it is often unrealistic to suppose that the heterogeneity of the data can only be explained by one variable. If such an assumption is made, this could lead to a high number of clusters which could be difficult to interpret. A model based multi-objective clustering is proposed, is assumes the existence of several latent clustering variables, each one explaining the heterogeneity of the data on some clustering projection. In order to estimate the parameters of the model an EM algorithm is proposed, it mainly relies on a reinterpretation of the standard factorial discriminant analysis in a probabilistic way. The obtained results are projections of the data on some principal clustering components allowing some synthetic interpretation of the principal clusters raised by the data. The behavior of the model is illustrated on simulated and real data.
Document type :
Documents associated with scientific events
Complete list of metadata

https://hal.inria.fr/hal-01424965
Contributor : Vincent Vandewalle Connect in order to contact the contributor
Submitted on : Tuesday, January 3, 2017 - 10:58:18 AM
Last modification on : Wednesday, March 23, 2022 - 3:51:09 PM
Long-term archiving on: : Tuesday, April 4, 2017 - 1:09:16 PM

File

slidesERCIM.pdf
Files produced by the author(s)

Identifiers

  • HAL Id : hal-01424965, version 1

Collections

Citation

Vincent Vandewalle. Simultaneous dimension reduction and multi-objective clustering using probabilistic factorial discriminant analysis. CMStatistics 2016, Dec 2016, Sevilla, Spain. ⟨hal-01424965⟩

Share

Metrics

Record views

205

Files downloads

69