Skip to Main content Skip to Navigation
Conference papers

How to deal with missing data in supervised deep learning?

Niels Ipsen 1 Pierre-Alexandre Mattei 2, 3 Jes Frellsen 1
2 MAASAI - Modèles et algorithmes pour l’intelligence artificielle
CRISAM - Inria Sophia Antipolis - Méditerranée , UNS - Université Nice Sophia Antipolis (... - 2019), JAD - Laboratoire Jean Alexandre Dieudonné, Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems
Abstract : The issue of missing data in supervised learning has been largely overlooked, especially in the deep learning community. We investigate strategies to adapt neural architectures to handle missing values. Here, we focus on regression and classification problems where the features are assumed to be missing at random. Of particular interest are schemes that allow to reuse as-is a neural discriminative architecture. One scheme involves imputing the missing values with learnable constants. We propose a second novel approach that leverages recent advances in deep generative modelling. More precisely, a deep latent variable model can be learned jointly with the discriminative model, using importance-weighted variational inference in an end-to-end way. This hybrid approach, which mimics multiple imputation, also allows to impute the data, by relying on both the discriminative and the generative model. We also discuss ways of using a pre-trained generative model to train the discriminative one. In domains where powerful deep generative models are available, the hybrid approach leads to large performance gains.
Complete list of metadata
Contributor : Pierre-Alexandre Mattei Connect in order to contact the contributor
Submitted on : Monday, December 7, 2020 - 4:49:19 PM
Last modification on : Friday, January 21, 2022 - 3:09:54 AM
Long-term archiving on: : Monday, March 8, 2021 - 7:26:39 PM


Files produced by the author(s)


  • HAL Id : hal-03044144, version 1



Niels Ipsen, Pierre-Alexandre Mattei, Jes Frellsen. How to deal with missing data in supervised deep learning?. Artemiss - ICML Workshop on the Art of Learning with Missing Values, Jul 2020, Vienne, Austria. ⟨hal-03044144⟩



Les métriques sont temporairement indisponibles