Variable selection in model-based clustering and discriminant analysis with a regularization approach - Archive ouverte HAL Access content directly
Journal Articles Advances in Data Analysis and Classification Year : 2018

Variable selection in model-based clustering and discriminant analysis with a regularization approach

(1) , (2) , (3, 4)
1
2
3
4

Abstract

Several methods for variable selection have been proposed in model-based clustering and classification. These make use of backward or forward procedures to define the roles of the variables. Unfortunately, such stepwise procedures are slow and the resulting algorithms inefficient when analyzing large data sets with many variables. In this paper, we propose an alternative regularization approach for variable selection in model-based clustering and classification. In our approach the variables are first ranked using a lasso-like procedure in order to avoid slow stepwise algorithms. Thus, the variable selection methodology of Maugis et al. (Comput Stat Data Anal 53:3872–3882, 2009b) can be efficiently applied to high-dimensional data sets.
Fichier principal
Vignette du fichier
article.pdf (582.61 Ko) Télécharger le fichier
Origin : Files produced by the author(s)

Dates and versions

hal-01053784 , version 1 (01-08-2014)
hal-01053784 , version 2 (28-11-2017)
hal-01053784 , version 3 (17-04-2018)

Identifiers

Cite

Gilles Celeux, Cathy Maugis-Rabusseau, Mohammed Sedki. Variable selection in model-based clustering and discriminant analysis with a regularization approach. Advances in Data Analysis and Classification, 2018, ⟨10.1007/s11634-018-0322-5⟩. ⟨hal-01053784v3⟩
1094 View
1457 Download

Altmetric

Share

Gmail Facebook Twitter LinkedIn More