High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables - Inria - Institut national de recherche en sciences et technologies du numérique Access content directly
Journal Articles Statistics and Computing Year : 2015

High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables

Abstract

In this work we address the problem of approximating high-dimensional data with a low-dimensional representation. We make the following contributions. We propose an inverse regression method which exchanges the roles of input and response, such that the low-dimensional variable becomes the regressor, and which is tractable. We introduce a mixture of locally-linear probabilistic mapping model that starts with estimating the parameters of inverse regression, and follows with inferring closed-form solutions for the forward parameters of the high-dimensional regression problem of interest. Moreover, we introduce a partially-latent paradigm, such that the vector-valued response variable is composed of both observed and latent entries, thus being able to deal with data contaminated by experimental artifacts that cannot be explained with noise models. The proposed probabilistic formulation could be viewed as a latent-variable augmentation of regression. We devise expectation-maximization (EM) procedures based on a data augmentation strategy which facilitates the maximum-likelihood search over the model parameters. We propose two augmentation schemes and we describe in detail the associated EM inference procedures that may well be viewed as generalizations of a number of EM regression, dimension reduction, and factor analysis algorithms. The proposed framework is validated with both synthetic and real data. We provide experimental evidence that our method outperforms several existing regression techniques.
Fichier principal
Vignette du fichier
submission_rev.pdf (419.14 Ko) Télécharger le fichier
SupMat.pdf (597.2 Ko) Télécharger le fichier
Origin : Files produced by the author(s)
Format : Other
Loading...

Dates and versions

hal-00863468 , version 2 (09-01-2014)
hal-00863468 , version 3 (18-03-2014)

Identifiers

Cite

Antoine Deleforge, Florence Forbes, Radu Horaud. High-Dimensional Regression with Gaussian Mixtures and Partially-Latent Response Variables. Statistics and Computing, 2015, 25 (5), pp.893-911. ⟨10.1007/s11222-014-9461-5⟩. ⟨hal-00863468v3⟩
700 View
1104 Download

Altmetric

Share

Gmail Facebook X LinkedIn More