Clustering of the values of a response variable and simultaneous covariate selection using a stepwise algorithm - Inria - Institut national de recherche en sciences et technologies du numérique Accéder directement au contenu
Article Dans Une Revue Applied Mathematics Année : 2016

Clustering of the values of a response variable and simultaneous covariate selection using a stepwise algorithm

Résumé

In supervised learning the number of values of a response variable can be very high. Grouping these values in a few clusters can be useful to perform accurate supervised classification analyses. On the other hand selecting relevant covariates is a crucial step to build robust and efficient prediction models. We propose in this paper an algorithm that simultaneously groups the values of a response variable into a limited number of clusters and selects stepwise the best covariates that discriminate this clustering. These objectives are achieved by alternate optimization of a user-defined model selection criterion. This process extends a former version of the algorithm to a more general framework. Moreover possible further developments are discussed in detail.
Fichier principal
Vignette du fichier
Clustering of the values of a response variable.AM.12.07.2016.pdf (315.23 Ko) Télécharger le fichier
Origine : Publication financée par une institution
Loading...

Dates et versions

hal-01395535 , version 1 (10-11-2016)

Identifiants

Citer

Olivier Collignon, Jean-Marie Monnez. Clustering of the values of a response variable and simultaneous covariate selection using a stepwise algorithm. Applied Mathematics, 2016, 7 (15), pp.1639-1648. ⟨10.4236/am.2016.715141⟩. ⟨hal-01395535⟩
152 Consultations
389 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More