Skip to Main content Skip to Navigation
Conference papers

How to deal with missing data in supervised deep learning?

Niels Ipsen 1 Pierre-Alexandre Mattei 2, 3 Jes Frellsen 1
2 MAASAI - Modèles et algorithmes pour l’intelligence artificielle
CRISAM - Inria Sophia Antipolis - Méditerranée , Laboratoire I3S - SPARKS - Scalable and Pervasive softwARe and Knowledge Systems, UNS - Université Nice Sophia Antipolis (... - 2019), JAD - Laboratoire Jean Alexandre Dieudonné
Abstract : The issue of missing data in supervised learning has been largely overlooked, especially in the deep learning community. We investigate strategies to adapt neural architectures to handle missing values. Here, we focus on regression and classification problems where the features are assumed to be missing at random. Of particular interest are schemes that allow to reuse as-is a neural discriminative architecture. One scheme involves imputing the missing values with learnable constants. We propose a second novel approach that leverages recent advances in deep generative modelling. More precisely, a deep latent variable model can be learned jointly with the discriminative model, using importance-weighted variational inference in an end-to-end way. This hybrid approach, which mimics multiple imputation, also allows to impute the data, by relying on both the discriminative and the generative model. We also discuss ways of using a pre-trained generative model to train the discriminative one. In domains where powerful deep generative models are available, the hybrid approach leads to large performance gains.
Complete list of metadatas

https://hal.inria.fr/hal-03044144
Contributor : Pierre-Alexandre Mattei <>
Submitted on : Monday, December 7, 2020 - 4:49:19 PM
Last modification on : Wednesday, December 9, 2020 - 3:40:32 AM

File

how_to_deal_with_missing_data_...
Files produced by the author(s)

Identifiers

  • HAL Id : hal-03044144, version 1

Collections

Citation

Niels Ipsen, Pierre-Alexandre Mattei, Jes Frellsen. How to deal with missing data in supervised deep learning?. ICML Workshop on the Art of Learning with Missing Values (Artemiss), Jul 2020, Vienne, Austria. ⟨hal-03044144⟩

Share

Metrics

Record views

15

Files downloads

40