Predicting Future Instance Segmentation by Forecasting Convolutional Features

Abstract : Anticipating future events is an important prerequisite towards intelligent behavior. Video forecasting has been studied as a proxy task towards this goal. Recent work has shown that to predict semantic segmentation of future frames, forecasting at the semantic level is more effective than forecasting RGB frames and then segmenting these. In this paper we consider the more challenging problem of future instance segmentation, which additionally segments out individual objects. To deal with a varying number of output labels per image, we develop a predictive model in the space of fixed-sized convolutional features of the Mask R-CNN instance segmentation model. We apply the "detection head" of Mask R-CNN on the predicted features to produce the instance segmentation of future frames. Experiments show that this approach significantly improves over strong baselines based on optical flow and repurposed instance segmentation architectures.
Type de document :
Communication dans un congrès
ECCV 2018 - European Conference on Computer Vision, Sep 2018, Munich, Germany. pp.1-21
Liste complète des métadonnées

https://hal.inria.fr/hal-01757669
Contributeur : Pauline Luc <>
Soumis le : mercredi 3 octobre 2018 - 12:19:52
Dernière modification le : samedi 6 octobre 2018 - 01:08:32

Fichier

luc18eccv_arxiv.pdf
Fichiers produits par l'(les) auteur(s)

Identifiants

  • HAL Id : hal-01757669, version 2

Collections

Citation

Pauline Luc, Camille Couprie, Yann Lecun, Jakob Verbeek. Predicting Future Instance Segmentation by Forecasting Convolutional Features. ECCV 2018 - European Conference on Computer Vision, Sep 2018, Munich, Germany. pp.1-21. 〈hal-01757669v2〉

Partager

Métriques

Consultations de la notice

191

Téléchargements de fichiers

233